Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 60 tok/s
Gemini 2.5 Pro 51 tok/s Pro
GPT-5 Medium 39 tok/s Pro
GPT-5 High 40 tok/s Pro
GPT-4o 120 tok/s Pro
Kimi K2 211 tok/s Pro
GPT OSS 120B 416 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Comparing Machine Learning Algorithms by Union-Free Generic Depth (2312.12839v3)

Published 20 Dec 2023 in cs.LG and stat.ML

Abstract: We propose a framework for descriptively analyzing sets of partial orders based on the concept of depth functions. Despite intensive studies in linear and metric spaces, there is very little discussion on depth functions for non-standard data types such as partial orders. We introduce an adaptation of the well-known simplicial depth to the set of all partial orders, the union-free generic (ufg) depth. Moreover, we utilize our ufg depth for a comparison of machine learning algorithms based on multidimensional performance measures. Concretely, we provide two examples of classifier comparisons on samples of standard benchmark data sets. Our results demonstrate promisingly the wide variety of different analysis approaches based on ufg methods. Furthermore, the examples outline that our approach differs substantially from existing benchmarking approaches, and thus adds a new perspective to the vivid debate on classifier comparison.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (64)
  1. Dependency structures of data base relationships. International Federation for Information Processing Congress 74, 580–583.
  2. Modifying Bradley–Terry and other ranking models to allow ties. IMA Journal of Management Mathematics 32, 451–463.
  3. Mining minimal non-redundant association rules using frequent closed itemsets, in: Lloyd, J., Dahl, V., Furbach, U., Kerber, M., Lau, K., Palamidessi, C., Pereira, L., Sagiv, Y., Stuckey, P. (Eds.), Computational Logic — CL 2000, Springer. pp. 972–986.
  4. Should we really use post-hoc tests based on mean-ranks? The Journal of Machine Learning Research 17, 152–161.
  5. Lattices, closures systems and implication bases: A survey of structural aspects and algorithms. Theoretical Computer Science 743, 93–109.
  6. Data depth functions for non-standard data by use of formal concept analysis. URL: https://www.foundstat.statistik.uni-muenchen.de/personen/mitglieder/blocher/blocheretal_properties23.pdf. [Accessed: 21.11.2023].
  7. Statistical models for partial orders based on data depth and formal concept analysis, in: Ciucci, D., Couso, I., Medina, J., Slezak, D., Petturiti, D., Bouchon-Meunier, B., Yager, R. (Eds.), Information Processing and Management of Uncertainty in Knowledge-Based Systems, Springer. pp. 17–30.
  8. Depth functions for partial orders with a descriptive analysis of machine learning algorithms, in: Miranda, E., Montes, I., Quaeghebeur, E., Vantaggi, B. (Eds.), Proceedings of the Thirteenth International Symposium on Imprecise Probability: Theories and Applications, Proceedings of Machine Learning Research. pp. 59–71.
  9. Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39, 324–345.
  10. Comparing and aggregating partial orders with Kendall tau distances., in: Rahman, S., Nakano, S. (Eds.), WALCOM: Algorithms and Computation 2012. Lecture Notes in Computer Science, pp. 88–99.
  11. On over-fitting in model selection and subsequent selection bias in performance evaluation. Journal of Machine Learning Research 11, 2079–2107.
  12. Revealed Preference Theory. Cambridge University Press. chapter Stochastic Choice. Econometric Society Monographs, pp. 95–113.
  13. A stochastic dominance approach to financial risk management strategies. Journal of Econometrics 187, 472–485.
  14. Partial order relations for classification comparisons. Canadian Journal of Statistics 48, 152–166.
  15. Statistical reasoning with set-valued information: Ontic vs. epistemic views. International Journal of Approximate Reasoning 55, 1502–1518.
  16. Metric methods for analyzing partially ranked data. volume 34 of Lecture Notes in Statistics. Springer.
  17. On extending the Bradley-Terry model to accommodate ties in paired comparison experiments. Journal of the American Statistical Association 65, 317–328.
  18. Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research 7, 1–30.
  19. Uci machine learning repository. URL: http://archive.ics.uci.edu/ml. [Accessed: 09.10.2023].
  20. Chapter 2.1 - Helly, Radon, and Carathéodory type theorems, in: Gruber, P., Wwillis, J. (Eds.), Handbook of Convex Geometry. North-Holland, Amsterdam, pp. 389–448.
  21. Domain-based benchmark experiments: Exploratory and inferential analysis. Austrian Journal of Statistics 41, 5–26.
  22. Distance based ranking models. Journal of the Royal Statistical Society. Series B (Methodological) 48, 359–369.
  23. Package glmnet. CRAN R Repository .
  24. Two basic algorithms in concept analysis, in: Formal Concept Analysis: 8th International Conference, ICFCA 2010, Agadir, Morocco, March 15-18, 2010. Proceedings 8, Springer. pp. 312–340.
  25. Formal Concept Analysis: Mathematical Foundations. Springer.
  26. Statistical depth functions for ranking distributions: Definitions, statistical learning and applications. URL: https://arxiv.org/abs/2201.08105, arXiv:2201.08105. [Accessed: 13.11.2023].
  27. Weighted k-nearest-neighbor techniques and ordinal classification. Technical Report, LMU. URL: http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-1769-9. [Accessed: 28.11.2023].
  28. The design and analysis of benchmark experiments. Journal of Computational and Graphical Statistics 14, 675–699.
  29. Information efficient learning of complexly structured preferences: Elicitation procedures and their application to decision making under uncertainty. International Journal of Approximate Reasoning 144, 69–91.
  30. Statistical comparisons of classifiers by generalized stochastic dominance. Journal of Machine Learning Research 24, 1–37.
  31. Concepts for decision making under severe uncertainty with partial ordinal and partial cardinal preferences. International Journal of Approximate Reasoning 98, 112–131.
  32. A probabilistic evaluation framework for preference aggregation reflecting group homogeneity. Mathematical Social Sciences 96, 49–62.
  33. Multi-target decision making under conditions of severe uncertainty, in: Torra, V., Narukawa, Y. (Eds.), Modeling Decisions for Artificial Intelligence, Springer. pp. 45–57.
  34. Robust statistical comparison of random variables with locally varying scale of measurement, in: Evans, R.J., Shpitser, I. (Eds.), Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, Proceedings of Machine Learning Research. pp. 941–952.
  35. Sequential decision making with partially ordered preferences. Artificial Intelligence 175, 1346 – 1365.
  36. Non-parametric modeling of partially ranked data. Journal of Machine Learning Research 9, 2401–2429.
  37. Ordering uncertain options under inflation: A note. The Journal of Finance 39, 1223–1229.
  38. On a notion of data depth based on random simplices. The Annals of Statistics 18, 405–414.
  39. Credal sum-product networks, in: Antonucci, A., Corani, G., Couso, I., Destercke, S. (Eds.), Proceedings of the Tenth International Symposium on Imprecise Probability: Theories and Applications, Proceedings of Machine Learning Research. pp. 205–216.
  40. Multivariate Dispersion, Central Regions, and Depth: The Lift Zonoid Approach. Springer.
  41. Choosing among notions of multivariate depth statistics. Statistical Science 37, 348–368.
  42. Learning partially ranked data based on graph regularization. arXiv:1902.10963. [accessed: 28.11.2023].
  43. P. V. Rao and L.L. Kupper, 1967. Ties in paired-comparison experiments: A generalization of the bradley-terry model. Journal of the American Statistical Association 62, 194–204.
  44. Incompleteness and incomparability in preference aggregation: Complexity results. Artificial Intelligence 175, 1272–1289.
  45. The analysis of permutations. Journal of the Royal Statistical Society Series C: Applied Statistics 24, 193–202.
  46. Statistical modelling under epistemic data imprecision: some results on estimating multinomial distributions and logistic regression for coarse categorical data, in: Augustin, T., Doria, S., Miranda, E., Quaeghebeur, E. (Eds.), Proceedings of the Ninth International Symposium on Imprecise Probability: Theories and Applications, Aracne. pp. 247–256.
  47. Statistical modelling in surveys without neglecting the undecided: Multinomial logistic regression models and imprecise classification trees under ontic data imprecision, in: Augustin, T., Doria, S., Miranda, E., Quaeghebeur, E. (Eds.), Proceedings of the Ninth International Symposium on Imprecise Probability: Theories and Applications, Aracne. pp. 257–266.
  48. Application of lower quantiles for complete lattices to ranking data: Analyzing outlyingness of preference orderings. Technical Report, LMU. URL: http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-40452-9. [Accessed: 28.11.2023].
  49. Lower quantiles for complete lattices. Technical Report, LMU. URL: http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-40448-7. [Accessed: 28.11.2023].
  50. A short note on the equivalence of the ontic and the epistemic view on data imprecision for the case of stochastic dominance for interval-valued data, in: De Bock, J., de Campos, C., de Cooman, G., Quaeghebeur, E., Wheeler, G. (Eds.), Proceedings of the Eleventh International Symposium on Imprecise Probabilities: Theories and Applications, Proceedings of Machine Learning Research. pp. 330–337.
  51. Detecting stochastic dominance for poset-valued random variables as an example of linear programming on closure systems. Technical Report, LMU. URL: http://nbn-resolving.de/urn/resolver.pl?urn=nbn:de:bvb:19-epub-40416-0. [Accessed: 28.11.2023].
  52. A representation of partially ordered preferences. Annals of Statistics 23, 2168–2217.
  53. General notions of statistical depth function. The Annals of Statistics 28, 461 – 482.
  54. Glim for preference, in: Gilchrist, R. (Ed.), GLIM 82: Proceedings of the International Conference on Generalised Linear Models. Springer, pp. 164–178.
  55. Statistical inference for interval identified parameters, in: Augustin, T., Coolen, F., Moral, S., Troffaes, M. (Eds.), Proceedings of the Sixth International Symposium on Imprecise Probabilities: Theories and Applications, Aracne. pp. 395–404.
  56. Package rpart. URL: http://cran.ma.ic.ac.uk/web/packages/rpart/rpart.pdf. [Accessed: 15.02.2023].
  57. Dimension of the crown skn. Discrete Mathematics 8, 85–103.
  58. Mathematics and the picturing of data, in: James, R. (Ed.), Proceedings of the International Congress of Mathematicians Vancouver, Mathematics-Congresses, Vancouver. pp. 523–531.
  59. Openml: Networked science in machine learning. SIGKDD Explorations 15, 49–60.
  60. On the uniform convergence of relative frequencies of events to their probabilities, in: Vovk, V., Papadopoulos, H., Gammerman, A. (Eds.), Measures of Complexity: Festschrift for Alexey Chervonenkis, Springer. pp. 11–30.
  61. Bias in error estimation when using cross-validation for model selection. BMC Bioinformatics 7, 1–8.
  62. ranger: A fast implementation of random forests for high dimensional data in C++ and R. Journal of Statistical Software 77, 1–17.
  63. The naive credal classifier. Journal of Statistical Planning and Inference 105, 5–21.
  64. Evaluating credal classifiers by utility-discounted predictive accuracy. International Journal of Approximate Reasoning 53, 1282–1301.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 post and received 1 like.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube