Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Active learning with biased non-response to label requests (2312.08150v2)

Published 13 Dec 2023 in cs.LG, stat.ME, and stat.ML

Abstract: Active learning can improve the efficiency of training prediction models by identifying the most informative new labels to acquire. However, non-response to label requests can impact active learning's effectiveness in real-world contexts. We conceptualise this degradation by considering the type of non-response present in the data, demonstrating that biased non-response is particularly detrimental to model performance. We argue that biased non-response is likely in contexts where the labelling process, by nature, relies on user interactions. To mitigate the impact of biased non-response, we propose a cost-based correction to the sampling strategy--the Upper Confidence Bound of the Expected Utility (UCB-EU)--that can, plausibly, be applied to any active learning algorithm. Through experiments, we demonstrate that our method successfully reduces the harm from labelling non-response in many settings. However, we also characterise settings where the non-response bias in the annotations remains detrimental under UCB-EU for specific sampling methods and data generating processes. Finally, we evaluate our method on a real-world dataset from an e-commerce platform. We show that UCB-EU yields substantial performance improvements to conversion models that are trained on clicked impressions. Most generally, this research serves to both better conceptualise the interplay between types of non-response and model improvements via active learning, and to provide a practical, easy-to-implement correction that mitigates model degradation.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Audibert JY, Bubeck S, Munos R (2010) Best arm identification in multi-armed bandits. In: COLT, pp 41–53 Barbieri et al [2016] Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770 Bartók et al [2014] Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770 Bartók et al [2014] Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  2. Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770 Bartók et al [2014] Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  3. Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  4. Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  5. Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  6. Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  7. Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  8. Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  9. Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  10. Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  11. Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  12. Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  13. King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  14. Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  15. Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  16. Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  17. Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  18. Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  19. Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  20. Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  21. McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  22. Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  23. Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  24. Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  25. Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  26. Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  27. Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  28. Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  29. Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  30. Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  31. Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  32. Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  33. Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  34. Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  35. Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
  36. Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733

Summary

  • The paper presents the UCB-EU method, integrating response probability into sampling strategies to counteract biased non-response.
  • It reveals how non-response effects—both volume and imbalance—can substantially hinder model training if left unmanaged.
  • Empirical results, including an e-commerce case study, demonstrate that addressing bias improves post-click model accuracy.

Introduction

Active Learning (AL) is an area of research in machine learning that focuses on improving the learning model by selectively acquiring new data points that are most useful to the model's learning process. Conventionally, AL is premised on the assumption that every request for the label of an unlabeled example will be met with a response—something that doesn't always align with real-world scenarios. Non-responses can be prevalent, especially when the data labeling process involves human interaction or is derived from user interactions with a platform, leading to potential biases that undermine AL's efficiency.

This paper addresses the challenges posed by biased non-response in the active learning context. The authors propose a method to adjust AL processes to account for potential non-response, presenting both theoretical analysis and empirical validations, including a case paper using a real-world e-commerce dataset.

Impact of Non-Response

It is argued that biased non-response can significantly impair a model's performance. Two kinds of impacts are theorized: a "volume effect," where the number of training samples is reduced, and an "imbalance effect," where the training examples' distribution becomes skewed. These effects can create a feedback loop that either stalls or deteriorates model improvement over time.

The paper introduces two forms of non-responses: Missing Completely at Random (MCAR), where non-response is distributed uniformly and does not relate to the characteristics of the data, and Missing at Random (MAR), where non-response is associated with observed features in the data. Dealing with MAR situations proves more challenging, particularly when high-value information coincides with high non-response rates, presenting significant hurdles for AL effectiveness.

Methodology for Addressing Non-Response

The authors propose an algorithmic correction termed the Upper Confidence Bound of the Expected Utility (UCB-EU), which adjusts the sampling strategy to consider the estimated probability of obtaining a response. This approach is designed to be compatible with various AL querying strategies and integrates an element of the estimated probability of label acquisition to avoid allocating resources to regions of the dataset where non-responses dominate.

Experimental Validation

The proposed method is subjected to experiments using synthetic data scenarios and a case paper involving the Taobao e-commerce platform. The simulations demonstrate that the UCB-EU correction can improve AL model performance, particularly in contexts where non-response is prevalent and biased. In the Taobao case paper, integrating UCB-EU leads to significant improvements in the post-click model, effectively handling the non-response biases generated by user interactions.

Conclusion

The presence of biased non-response in active learning can substantially impact the effectiveness of model improvement efforts. The UCB-EU correction presented in this paper provides a practical solution to this problem, facilitating more reliable and efficient model training in contexts where non-response biases are likely to occur. By merging the principles of expected utility with the realities of data acquisition challenges, this research bridges a significant gap between theoretical assumptions in AL and the messy complexities of real-world applications.

X Twitter Logo Streamline Icon: https://streamlinehq.com