Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Comparing discriminating abilities of evaluation metrics in link prediction (2401.03673v1)

Published 8 Jan 2024 in cs.SI and physics.data-an

Abstract: Link prediction aims to predict the potential existence of links between two unconnected nodes within a network based on the known topological characteristics. Evaluation metrics are used to assess the effectiveness of algorithms in link prediction. The discriminating ability of these evaluation metrics is vitally important for accurately evaluating link prediction algorithms. In this study, we propose an artificial network model, based on which one can adjust a single parameter to monotonically and continuously turn the prediction accuracy of the specifically designed link prediction algorithm. Building upon this foundation, we show a framework to depict the effectiveness of evaluating metrics by focusing on their discriminating ability. Specifically, a quantitative comparison in the abilities of correctly discerning varying prediction accuracies was conducted encompassing nine evaluation metrics: Precision, Recall, F1-Measure, Matthews Correlation Coefficient (MCC), Balanced Precision (BP), the Area Under the receiver operating characteristic Curve (AUC), the Area Under the Precision-Recall curve (AUPR), Normalized Discounted Cumulative Gain (NDCG), and the Area Under the magnified ROC (AUC-mROC). The results indicate that the discriminating abilities of the three metrics, AUC, AUPR, and NDCG, are significantly higher than those of other metrics.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. Lü L and Zhou T 2011 Link prediction in complex networks: A survey Physica A: Statistical Mechanics and its Applications 390 1150-1170
  2. Liben-Nowell D and Kleinberg J 2007 The link-prediction problem for social networks Journal of the Association for Information Science and Technology 58 1019-1031
  3. Clauset A, Moore C and Newman M E J 2008 Hierarchical structure and the prediction of missing links in networks Nature 453 98–101
  4. Guimerà R and Sales-Pardo M 2009 Missing and spurious interactions and the reconstruction of complex networks PNAS 106 22073-22078
  5. Leskovec J, Huttenlocher D and Kleinberg J 2010 Predicting positive and negative links in online social networks Proceedings of the 19th International Conference on World Wide Web (WWW’10) (Association for Computing Machinery) pp 641–650
  6. Huang Z and Lin D K J 2009 The time-series link prediction problem with applications in communication surveillance INFORMS Journal on Computing 21 286–303
  7. Tang J, Wu S and Sun J 2013 Confluence: Conformity influence in large social networks Proceedings of the 19th ACM SIGKDD International Conference on Knowllink Discovery and Data Mining (KDD’13) (Association for Computing Machinery) pp 347–355
  8. Adamic L A and Adar E 2003 Friends and neighbors on the web Social Networks 25 211–230
  9. Yu H et al 2008 High-quality binary protein interaction map of the yeast interactome network Science 322 104–110
  10. Sulaimany S, Khansari M and Masoudi-Nejad A 2018 Link prediction potentials for biological networks International Journal of Data Mining and Bioinformatics 20 161–184
  11. Lei C and Ruan J 2013 A novel link prediction algorithm for reconstructing protein–protein interaction networks by topological similarity Bioinformatics 29 355-364
  12. Barabasi A L and Oltvai Z N 2004 Network biology: understanding the cell’s functional organization Nature Reviews Genetics 5 101–113
  13. Zhou T 2021 Progresses and challenges in link prediction iScience 24 103217
  14. Zhou T, Lü L and Zhang Y C 2009 Predicting missing links via local information The European Physical Journal B 71 623–630
  15. Lichtenwalter R N, Lussier J T and Chawla N V 2010 New perspectives and methods in link prediction Proceedings of the 16th ACM SIGKDD International Conference on Knowllink Discovery and Data Mining (KDD’10) (Association for Computing Machinery) pp 243–252
  16. Saito T and Rehmsmeier M 2015 The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets PLoS ONE 10 e0118432
  17. Hand D J and Till R J 2001 A simple generalisation of the area under the ROC curve for multiple class classification problems Machine Learning 45 171–186
  18. Yang Y, Lichtenwalter R N and Chawla N V 2015 Evaluating link prediction methods Knowllink and Information Systems 45 751–782
  19. Austin M 2007 Species distribution models and ecological theory: a critical assessment and some possible new approaches Ecological Modelling 200 1–19
  20. Lobo J M, Jiménez-Valverde A and Real R 2008 AUC: a misleading measure of the performance of predictive distribution models Global Ecology and Biogeography 17 145–151
  21. Muscoloni A and Cannistraci C V 2022 Early retrieval problem and link prediction evaluation via the area under the magnified ROC (Preprints: 202209.0277)
  22. Del Genio C I, Gross T and Bassler K E 2011 All scale-free networks are sparse Physical Review Letters 107 178701
  23. Zhou T 2023 Discriminating abilities of threshold-free evaluation metrics in link prediction Physica A: Statistical Mechanics and its Applications 615 128529
  24. Newman M E J 2003 The structure and function of complex networks SIAM Review 45 167–256
  25. Buckland M and Gey F 1994 The relationship between Precision and Recall Journal of the Association for Information Science and Technology 45 12–19
  26. Sasaki Y 2007 The truth of the F-measure Teach Tutor Mater 1 1–5
  27. Matthews B W 1975 Comparison of the predicted and observed secondary structure of T4 phage lysozyme Biochimica et Biophysica Acta (BBA) - Protein Structure 405 442–451
  28. Järvelin K and Kekäläinen J 2002 Cumulated gain-based evaluation of IR techniques ACM Transactions on Information Systems (TOIS) 20 422–446
  29. Hanley J A and McNeil B J 1982 The meaning and use of the area under a receiver operating characteristic (ROC) curve Radiology 143 29–36
  30. Davis J and Goadrich M 2006 The relationship between precision-recall and ROC curves Proceedings of the 23rd International Conference on Machine Learning (ICML’06) (Association for Computing Machinery) pp 233–240
Citations (4)

Summary

We haven't generated a summary for this paper yet.