链接预测中评估指标之间的不一致性。

Inconsistency among evaluation metrics in link prediction.

作者信息

Bi Yilin, Jiao Xinshan, Lee Yan-Li, Zhou Tao

机构信息

CompleX Lab, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

School of Computer and Software Engineering, Xihua University, Chengdu 610039, China.

出版信息

PNAS Nexus. 2024 Nov 6;3(11):pgae498. doi: 10.1093/pnasnexus/pgae498. eCollection 2024 Nov.

DOI:10.1093/pnasnexus/pgae498

PMID:39564572

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11574622/

Abstract

Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links, and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation metrics for algorithm performance are usually chosen at will. This paper implements extensive experiments on hundreds of real networks and 26 well-known algorithms, revealing significant inconsistency among evaluation metrics, namely different metrics probably produce remarkably different rankings of algorithms. Therefore, we conclude that any single metric cannot comprehensively or credibly evaluate algorithm performance. In terms of information content, we suggest the usage of at least two metrics: one is the area under the receiver operating characteristic curve, and the other is one of the following three candidates, say the area under the precision-recall curve, the area under the precision curve, and the normalized discounted cumulative gain. When the data are imbalanced, say the number of negative samples significantly outweighs the number of positive samples, the area under the generalized Receiver Operating Characteristic curve should also be used. In addition, as we have proved the essential equivalence of threshold-dependent metrics, if in a link prediction task, some specific thresholds are meaningful, we can consider any one threshold-dependent metric with those thresholds. This work completes a missing part in the landscape of link prediction, and provides a starting point toward a well-accepted criterion or standard to select proper evaluation metrics for link prediction.

摘要

链路预测是网络科学中一个典型且具有挑战性的问题，其目的是基于已知拓扑结构预测缺失的链路、未来的链路以及时间链路。随着链路预测算法数量的不断增加，一个关键但此前被忽视的风险是，算法性能的评估指标通常是随意选择的。本文对数百个真实网络和26种知名算法进行了广泛的实验，揭示了评估指标之间存在显著的不一致性，即不同的指标可能会产生截然不同的算法排名。因此，我们得出结论，任何单一指标都无法全面或可靠地评估算法性能。在信息内容方面，我们建议至少使用两个指标：一个是接收器操作特征曲线下的面积，另一个是以下三个候选指标之一，即精确率-召回率曲线下的面积、精确率曲线下的面积以及归一化折损累计增益。当数据不平衡时，即负样本数量显著超过正样本数量时，还应使用广义接收器操作特征曲线下的面积。此外，由于我们已经证明了依赖阈值的指标本质上是等价的，如果在链路预测任务中某些特定阈值是有意义的，我们可以考虑任何一个带有这些阈值的依赖阈值的指标。这项工作填补了链路预测领域中缺失的一部分，并为选择链路预测的合适评估指标提供了一个被广泛接受的标准或准则的起点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/338bd38a6524/pgae498f1.jpg

相似文献

Inconsistency among evaluation metrics in link prediction.

PNAS Nexus. 2024 Nov 6;3(11):pgae498. doi: 10.1093/pnasnexus/pgae498. eCollection 2024 Nov.

Missing Link Prediction using Common Neighbor and Centrality based Parameterized Algorithm.

Sci Rep. 2020 Jan 15;10(1):364. doi: 10.1038/s41598-019-57304-y.

Link prediction based on spectral analysis.

PLoS One. 2024 Jan 2;19(1):e0287385. doi: 10.1371/journal.pone.0287385. eCollection 2024.

Efficient link prediction in the protein-protein interaction network using topological information in a generative adversarial network machine learning model.

BMC Bioinformatics. 2022 Feb 19;23(1):78. doi: 10.1186/s12859-022-04598-x.

Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.

Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

Identifying accurate link predictors based on assortativity of complex networks.

Sci Rep. 2022 Oct 27;12(1):18107. doi: 10.1038/s41598-022-22843-4.

Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury.

Traffic Inj Prev. 2018;19(sup2):S121-S126. doi: 10.1080/15389588.2018.1519805. Epub 2018 Dec 20.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Graph Neural Network-Based Efficient Subgraph Embedding Method for Link Prediction in Mobile Edge Computing.

Sensors (Basel). 2023 May 20;23(10):4936. doi: 10.3390/s23104936.

A potential energy and mutual information based link prediction approach for bipartite networks.

Sci Rep. 2020 Nov 26;10(1):20659. doi: 10.1038/s41598-020-77364-9.

引用本文的文献

Fine-Scale Risk Mapping for Dengue Vector Using Spatial Downscaling in Intra-Urban Areas of Guangzhou, China.

Insects. 2025 Jun 25;16(7):661. doi: 10.3390/insects16070661.

Link prediction of heterogeneous complex networks based on an improved embedding learning algorithm.

PLoS One. 2025 Jan 7;20(1):e0315507. doi: 10.1371/journal.pone.0315507. eCollection 2025.

本文引用的文献

Link prediction accuracy on real-world networks under non-uniform missing-edge patterns.

PLoS One. 2024 Jul 18;19(7):e0306883. doi: 10.1371/journal.pone.0306883. eCollection 2024.

Link prediction using low-dimensional node embeddings: The measurement problem.

Proc Natl Acad Sci U S A. 2024 Feb 20;121(8):e2312527121. doi: 10.1073/pnas.2312527121. Epub 2024 Feb 16.

A Survey on Hyperlink Prediction.

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15034-15050. doi: 10.1109/TNNLS.2023.3286280. Epub 2024 Oct 30.

Information cocoons in online navigation.

iScience. 2022 Dec 28;26(1):105893. doi: 10.1016/j.isci.2022.105893. eCollection 2023 Jan 20.

"Stealing fire or stacking knowledge" by machine intelligence to model link prediction in complex networks.

iScience. 2022 Nov 30;26(1):105697. doi: 10.1016/j.isci.2022.105697. eCollection 2023 Jan 20.

Exploring drought-responsive crucial genes in .

iScience. 2022 Oct 14;25(11):105347. doi: 10.1016/j.isci.2022.105347. eCollection 2022 Nov 18.

Link recommendation algorithms and dynamics of polarization in online social networks.

Proc Natl Acad Sci U S A. 2021 Dec 14;118(50). doi: 10.1073/pnas.2102141118.

Progresses and challenges in link prediction.

iScience. 2021 Oct 5;24(11):103217. doi: 10.1016/j.isci.2021.103217. eCollection 2021 Nov 19.

Stacking models for nearly optimal link prediction in complex networks.

Proc Natl Acad Sci U S A. 2020 Sep 22;117(38):23393-23400. doi: 10.1073/pnas.1914950117. Epub 2020 Sep 4.

Protein Interface Complementarity and Gene Duplication Improve Link Prediction of Protein-Protein Interaction Network.

Front Genet. 2020 Apr 2;11:291. doi: 10.3389/fgene.2020.00291. eCollection 2020.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

链接预测中评估指标之间的不一致性。

Inconsistency among evaluation metrics in link prediction.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献