Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 85 tok/s
Gemini 2.5 Pro 55 tok/s Pro
GPT-5 Medium 35 tok/s Pro
GPT-5 High 35 tok/s Pro
GPT-4o 123 tok/s Pro
Kimi K2 203 tok/s Pro
GPT OSS 120B 457 tok/s Pro
Claude Sonnet 4.5 35 tok/s Pro
2000 character limit reached

Unsupervised Estimation of Ensemble Accuracy (2311.10940v2)

Published 18 Nov 2023 in cs.AI

Abstract: Ensemble learning combines several individual models to obtain a better generalization performance. In this work we present a practical method for estimating the joint power of several classifiers. It differs from existing approaches which focus on "diversity" measures by not relying on labels. This makes it both accurate and practical in the modern setting of unsupervised learning with huge datasets. The heart of the method is a combinatorial bound on the number of mistakes the ensemble is likely to make. The bound can be efficiently approximated in time linear in the number of samples. We relate the bound to actual misclassifications, hence its usefulness as a predictor of performance. We demonstrate the method on popular large-scale face recognition datasets which provide a useful playground for fine-grain classification tasks using noisy data over many classes.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)
  1. Approximation schemes for scheduling on parallel machines. Journal of Scheduling, 1:55–66, December 1998.
  2. Digiface-1m: 1 million digital face images for face recognition. In 2023 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 2023.
  3. A survey on ensemble learning. Frontiers of Computer Science, 14(2):241–258, April 2020.
  4. A desicion-theoretic generalization of on-line learning and an application to boosting. In Paul Vitányi, editor, Computational Learning Theory, pages 23–37, 1995.
  5. Ensemble deep learning: A review. Engineering Applications of Artificial Intelligence, 115:105151, 2022.
  6. Ron L. Graham. Bounds on multiprocessing timing anomalies. SIAM Journal on Applied Mathematics, 2(17):416–429, March 1969.
  7. Davis E. King. Dlib c++ library. http://dlib.net. Version 19.24; Accessed: 2023-08-31.
  8. Davis E. King. Dlib-ml: A machine learning toolkit. Journal of Machine Learning Research, 10:1755–1758, 2009.
  9. Richard E. Korf. Multi-way number partitioning. In IJCAI-09, pages 538–543, 2009.
  10. Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Machine Learning, 51:181––207, 2003.
  11. Deep learning face attributes in the wild. In Proceedings of International Conference on Computer Vision (ICCV), December 2015.
  12. The degree sequence of a random graph. I. The models. Random Structures & Algorithms, 11(2):97–117, September 1997.
  13. Stephan Mertens. The easiest hard problem: Number partitioning. In Allon G. Percus, Gabriel Istrate, and Cristopher Moore, editors, Computational Complexity and Statistical Physics, pages 125–139. Oxford University Press, 2006.
  14. Robi Polikar. Ensemble based systems in decision making. IEEE Circuits and Systems Magazine, 6(3):21–45, 2006.
  15. Lior Rokach. Ensemble-based classifiers. Artif. Intell. Rev., 33:1–39, 02 2010.
  16. Shareboost: Efficient multiclass learning with feature sharing, 2011.
  17. Learning with ensembles: How overfitting can be useful. Advances in Neural Information Processing Systems, 8:190–196, 1996.
  18. Sharing visual features for multiclass and multiview object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(5):854–869, 2007.
  19. Dataset cleaning – a cross validation methodology for large facial datasets using face recognition, 2020.

Summary

We haven't generated a summary for this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube