Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
GPT-4o
Gemini 2.5 Pro Pro
o3 Pro
GPT-4.1 Pro
DeepSeek R1 via Azure Pro
2000 character limit reached

Uncertainty in Ranking (2107.03459v4)

Published 7 Jul 2021 in stat.ME

Abstract: Ranks estimated from data are uncertain and this poses a challenge in many applications. However, estimated ranks are deterministic functions of estimated parameters, so the uncertainty in the ranks must be determined by the uncertainty in the parameter estimates. We give a complete characterization of this relationship in terms of the linear extensions of a partial order determined by interval estimates of the parameters of interest. We then use this relationship to give a set estimator for the overall ranking, use its size to measure the uncertainty in a ranking, and give efficient algorithms for several questions of interest. We show that our set estimator is a valid confidence set and describe its relationship to a joint confidence set for ranks recently proposed by Klein, Wright & Wieczorek. We apply our methods to both simulated and real data and make them available through the R package rankUncertainty.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Computational Complexity: A Modern Approach. Cambridge University Press.
  2. magrittr: A Forward-Pipe Operator for R. R package version 1.5.
  3. Matrix: Sparse and Dense Matrix Classes and Methods. R package version 1.3-0.
  4. Bie, T. (2013). Confidence Intervals for Ranks: Theory and Applications in Binomial Data. Master’s thesis, Uppsala University.
  5. Graph Theory. Springer, 1st corrected edition.
  6. Counting Linear Extensions. Order, 8(3):225–242.
  7. Introduction to Algorithms. MIT Press, 3rd edition.
  8. xtable: Export Tables to LaTeX or HTML. R package version 1.8-4.
  9. Introduction to Lattices and Order. Cambridge University Press, 2nd edition.
  10. Rcpp: Seamless R and C++ Integration. Journal of Statistical Software, 40(8):1–18.
  11. Gallier, J. (2011). Discrete Mathematics. Springer.
  12. An Open Graph Visualization System and its Applications to Software Engineering. Software: Practice and Experience, 30(11):1203–1233.
  13. mvtnorm: Multivariate Normal and t Distributions. R package version 1.1-1.
  14. League Tables and Their Limitations: Statistical Issues in Comparisons of Institutional Performance. Journal of the Royal Statistical Society Series A, 159(3):385–443.
  15. Using the Bootstrap to Quantify the Authority of an Empirical Ranking. Annals of Statistics, 37(6B):3929–3959.
  16. Modeling the Variability of Rankings. The Annals of Statistics, 38(5):2652–2677.
  17. Holm, S. (2013). Confidence Intervals for Ranks. Working Paper 2013:10, Department of Statistics, Uppsala Universitet.
  18. Ranking Procedures for Several Normal Populations: An Empirical Investigation. International Journal of Statistical Sciences, 11:37–58.
  19. A Joint Confidence Region for an Overall Ranking of Populations. Journal of the Royal Statistical Society Series C, 69(3):589–606.
  20. Lai, R. (2020). arrangements: Fast Generators and Iterators for Permutations, Combinations, Integer Partitions and Compositions. R package version 1.1.9.
  21. Incorporating Natural Variation into IVF Clinic League Tables. Human Reproduction, 22(5):1359–1362.
  22. Louis, T. A. (1984). Estimating a Population of Parameter Values using Bayes and Empirical Bayes Methods. Journal of the American Statistical Association, 79(386):393–398.
  23. Probabilistic Preference Logic Networks. In Proceedings of the Twenty-first European Conference on Artificial Intelligence, pages 561–566.
  24. The Interval Inclusion Number of a Partially Ordered Set. Discrete Mathematics, 88(2-3):259–277.
  25. Global Partial Orders from Sequential Data. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 161–168.
  26. Reliability of League Tables of In Vitro Fertilisation Clinics: Retrospective Analysis of Live Birth Rates. British Medical Journal, 316:1701–1705.
  27. Mitas, J. (1994). Minimal Representation of Semiorders with Intervals of Same Length. In Orders, Algorithms and Applications, pages 162–175.
  28. Simultaneous Confidence Intervals for Ranks With Application to Ranking Institutions. arXiv:1812.05507.
  29. Convex Rank Tests and Semigraphoids. SIAM Journal on Discrete Mathematics, 23(3):1117–1134.
  30. Optimal Partial-Order Plan Relaxation via MaxSAT. Journal of Artificial Intelligence Research, 57:113–149.
  31. Structure Discovery in Bayesian Networks by Sampling Partial Orders. Journal of Machine Learning Research, 17(1):2002–2048.
  32. Peczarski, M. (2004). New Results in Minimum-Comparison Sorting. Algorithmica, 40(2):133–145.
  33. R Core Team (2020). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.
  34. tikzDevice: R Graphics Output in LaTeX Format. R package version 0.12.3.1.
  35. Supporting Ranking Queries on Uncertain and Incomplete Data. The VLDB Journal, 19(4):477–501.
  36. Counting Linear Extensions in Practice: MCMC versus Exponential Monte Carlo. In Thirty-Second AAAI Conference on Artificial Intelligence.
  37. Trotter, W. T. (1997). New Perspectives on Interval Orders and Interval Graphs. London Mathematical Society Lecture Note Series, 241:237–286.
  38. Causal Discovery via MML. In Proceedings of the Thirteenth International Conference on Machine Learning, volume 96, pages 516–524.
  39. Wickham, H. (2007). Reshaping Data with the reshape Package. Journal of Statistical Software, 21(12):1–20.
  40. Wickham, H. (2016). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York.
  41. Wieczorek, J. (2020). RankingProject: The Ranking Project: Visualizations for Comparing Populations. R package version 0.2.0.
  42. Ranking Populations Based on Sample Survey Data. Research Report Statistics 2014-07, Center for Statistical Research and Methodology, US Bureau of the Census, Washington DC.
  43. Confidence Intervals for Population Ranks in the Presence of Ties and Near Ties. Journal of the American Statistical Association, 104(486):775–788.
  44. Confidence Intervals for Ranks of Age-Adjusted Rates Across States or Counties. Statistics in Medicine, 33(11):1853–1866.
  45. Ranking Under Uncertainty. In Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence.
Citations (5)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Authors (1)