Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 60 tok/s
Gemini 2.5 Pro 51 tok/s Pro
GPT-5 Medium 39 tok/s Pro
GPT-5 High 40 tok/s Pro
GPT-4o 120 tok/s Pro
Kimi K2 211 tok/s Pro
GPT OSS 120B 416 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Partial Rankings of Optimizers (2402.16565v3)

Published 26 Feb 2024 in cs.LG and stat.ML

Abstract: We introduce a framework for benchmarking optimizers according to multiple criteria over various test functions. Based on a recently introduced union-free generic depth function for partial orders/rankings, it fully exploits the ordinal information and allows for incomparability. Our method describes the distribution of all partial orders/rankings, avoiding the notorious shortcomings of aggregation. This permits to identify test functions that produce central or outlying rankings of optimizers and to assess the quality of benchmarking suites.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (37)
  1. Kenneth J Arrow. A difficulty in the concept of social welfare. Journal of political economy, 58(4):328–346, 1950.
  2. Performance evaluation of an advanced local search evolutionary algorithm. In IEEE Congress on Evolutionary Computation. IEEE, 2005.
  3. Michael Bacharach. Group decisions in the face of differences of opinion. Management Science, 22(2):182–191, 1975.
  4. PD-MORL: Preference-driven multi-objective reinforcement learning algorithm. In The Eleventh International Conference on Learning Representations (ICLR), 2023.
  5. Depth functions for partial orders with a descriptive analysis of machine learning algorithms. In Proceedings of the Thirteenth International Symposium on Imprecise Probability: Theories and Applications (ISIPTA), volume 215, pp.  59–71. PMLR, 2023.
  6. Jean Charles de Borda. Mémoire sur les élections au scrutin. Histoire de l’Académie Royale des Sciences, 12, 1781.
  7. Marquis de Condorcet. Essai sur l’application de l’analyse a la probabilite des decisions rendues a la pluralite des voix, 1785. Paris.
  8. Benchmarking neural network training algorithms. arXiv preprint arXiv:2306.07179, 2023.
  9. A strategy for ranking optimization methods using multiple criteria. In Workshop on Automatic Machine Learning, pp.  11–20. PMLR, 2016.
  10. Rank aggregation methods for the web. In Proceedings of the 10th international conference on World Wide Web, pp.  613–622, 2001.
  11. Jürgen Eckhoff. Chapter 2.1 - Helly, Radon, and Carathéodory type theorems. In Handbook of Convex Geometry, pp.  389–448. North-Holland, Amsterdam, 1993.
  12. Statistical Decision Theory: Kendall’s Library of Statistics 9. Wiley, 2010.
  13. On a general definition of depth for functional data. Statistical Science, 32(4):630–639, 2017.
  14. Min-max multi-objective bilevel optimization with applications in robust machine learning. In International Conference on Learning Representations (ICLR), 2023.
  15. Comparing results of 31 algorithms from the black-box optimization benchmarking bbob-2009. In Proceedings of the 12th annual conference companion on Genetic and evolutionary computation, pp.  1689–1696, 2010.
  16. COCO: A platform for comparing continuous optimizers in a black-box setting. Optimization Methods and Software, 36:114–144, 2021. URL http://numbbo.github.io/coco/. (accessed: 08.12.2023).
  17. Anytime performance assessment in blackbox optimization benchmarking. IEEE Transactions on Evolutionary Computation, 26(6):1293–1305, 2022.
  18. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
  19. A probabilistic evaluation framework for preference aggregation reflecting group homogeneity. Mathematical Social Sciences, (96):49–62, 2018a.
  20. Concepts for decision making under severe uncertainty with partial ordinal and partial cardinal preferences. International Journal of Approximate Reasoning, 98:112–131, 2018b.
  21. Statistical comparisons of classifiers by generalized stochastic dominance. Journal of Machine Learning Research, 24(231):1–37, 2023a.
  22. Robust statistical comparison of random variables with locally varying scale of measurement. In Robin J. Evans and Ilya Shpitser (eds.), Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, volume 216 of Proceedings of Machine Learning Research, pp.  941–952. PMLR, 31 Jul–04 Aug 2023b.
  23. Preference ranking: An axiomatic approach. mathematical. Mathematical Models in Social Science., pp.  9–23, 1962.
  24. The number of finite topologies. Proceedings of the American Mathematical Society, 25(2):276, 1970.
  25. Regina Liu. On a notion of data depth based on random simplices. The Annals of Statistics, 18:405–414, 1990.
  26. Nsga-net: neural architecture search using multi-objective genetic algorithm. In Proceedings of the genetic and evolutionary computation conference, pp.  419–427, 2019.
  27. MLPerf training benchmark. In Proceedings of Machine Learning and Systems, volume 2, pp.  336–349, 2020.
  28. Benchmarking evolutionary multiobjective optimization algorithms. In IEEE Congress on Evolutionary Computation, pp.  1–8. IEEE, 2010.
  29. Analyzing the bbob results by means of benchmarking concepts. Evolutionary Computation, 23(1):161–185, 2015.
  30. Choosing among notions of multivariate depth statistics. Statistical Science, 37:348–368, 2022.
  31. Deepobs: A deep learning optimizer benchmark suite. In International Conference on Learning Representations (ICLR), 2019. URL https://deepobs.github.io/. (accessed: 08.12.2023).
  32. Optimizer benchmarking needs to account for hyperparameter tuning. In Proceedings of the 37th International Conference on Machine Learning (ICML), volume 119, pp.  9036–9045. PMLR, 2020.
  33. A dynamic multi-objective optimization method based on classification strategies. Scientific Reports, 13(1):15221, 2023.
  34. Mihalis Yannakakis. The complexity of the partial order dimension problem. SIAM Journal on Algebraic Discrete Methods, 3(3):351–358, 1982.
  35. Beyond one-preference-for-all: Multi-objective direct preference optimization. arXiv preprint arXiv:2310.03708, 2023.
  36. Scaling pareto-efficient decision making via offline multi-objective RL. In The Eleventh International Conference on Learning Representations (ICLR), 2023.
  37. General notions of statistical depth function. Annals of statistics, pp.  461–482, 2000.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 6 posts and received 16 likes.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube