Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Effective Individual Fairest Community Search over Heterogeneous Information Networks (2404.12107v1)

Published 18 Apr 2024 in cs.DB, cs.DS, and cs.SI

Abstract: Community search over heterogeneous information networks has been applied to wide domains, such as activity organization and team formation. From these scenarios, the members of a group with the same treatment often have different levels of activity and workloads, which causes unfairness in the treatment between active members and inactive members (called individual unfairness). However, existing works do not pay attention to individual fairness and do not sufficiently consider the rich semantics of HINs (e.g., high-order structure), which disables complex queries. To fill the gap, we formally define the issue of individual fairest community search over HINs (denoted as IFCS), which aims to find a set of vertices from the HIN that own the same type, close relationships, and small difference of activity level and has been demonstrated to be NP-hard. To do this, we first develop an exploration-based filter that reduces the search space of the community effectively. Further, to avoid repeating computation and prune unfair communities in advance, we propose a message-based scheme and a lower bound-based scheme. At last, we conduct extensive experiments on four real-world datasets to demonstrate the effectiveness and efficiency of our proposed algorithms, which achieve at least X3 times faster than the baseline solution.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. Vladimir Batagelj and Matjaz Zaversnik. 2003. An O (m) algorithm for cores decomposition of networks. arXiv preprint cs/0310049 (2003).
  2. Efficient subgraph matching by postponing cartesian products. In Proceedings of the 2016 International Conference on Management of Data. 1199–1214.
  3. Anchored vertex exploration for community engagement in social networks. In 2020 IEEE 36th International Conference on Data Engineering (ICDE). IEEE, 409–420.
  4. Efficiently computing k-edge connected components via graph decomposition. In Proceedings of the 2013 ACM SIGMOD international conference on management of data. 205–216.
  5. Jonathan Cohen. 2008. Trusses: Cohesive subgraphs for social network analysis. National security agency technical report 16, 3.1 (2008), 1–29.
  6. Fast enumeration of large k-plexes. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining. 115–124.
  7. Identifying functional modules in protein-protein interaction networks: an integrated exact approach. In ISMB. 223–231.
  8. Effective and efficient community search over large heterogeneous information networks. Proceedings of the VLDB Endowment 13, 6 (2020), 854–867.
  9. Uriel Feige. 2004. Approximating Maximum Clique by Removing Subgraphs. SIAM J. Discret. Math. 18, 2 (2004), 219–225.
  10. Fairness-aware explainable recommendation over knowledge graphs. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 69–78.
  11. Corrado Gini. 1921. Measurement of inequality of incomes. The economic journal 31, 121 (1921), 124–126.
  12. Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together. In Proceedings of the 2019 International Conference on Management of Data, SIGMOD Conference 2019, Amsterdam, The Netherlands, June 30 - July 5, 2019, Peter A. Boncz, Stefan Manegold, Anastasia Ailamaki, Amol Deshpande, and Tim Kraska (Eds.). ACM, 1429–1446. https://doi.org/10.1145/3299869.3319880
  13. Juris Hartmanis. 1982. Computers and intractability: a guide to the theory of np-completeness (michael r. garey and david s. johnson). Siam Review 24, 1 (1982), 90.
  14. Discovering maximal motif cliques in large heterogeneous information networks. In 2019 IEEE 35th International Conference on Data Engineering (ICDE). IEEE, 746–757.
  15. Effective and Efficient Relational Community Detection and Search in Large Dynamic Heterogeneous Information Networks. Proc. VLDB Endow. 13, 10 (2020), 1723–1736.
  16. Effective Community Search over Large Star-Schema Heterogeneous Information Networks. Proc. VLDB Endow. 15, 11 (2022), 2307–2320. https://www.vldb.org/pvldb/vol15/p2307-jiang.pdf
  17. Enhancement of the Neutrality in Recommendation.. In Decisions@ RecSys. Citeseer, 8–14.
  18. Inform: Individual fairness on graph mining. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 379–389.
  19. Guarantees for spectral clustering with fairness constraints. In International Conference on Machine Learning. PMLR, 3458–3467.
  20. Sequential algorithm for fast clique percolation. Physical review E 78, 2 (2008), 026109.
  21. Most influential community search over large social networks. In 2017 IEEE 33rd international conference on data engineering (ICDE). IEEE, 871–882.
  22. Most Influential Community Search over Large Social Networks. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). 871–882. https://doi.org/10.1109/ICDE.2017.136
  23. On Dyadic Fairness: Exploring and Mitigating Bias in Graph Connections. In ICLR. OpenReview.net.
  24. DotMotif: an open-source tool for connectome subgraph isomorphism search and graph queries. Scientific Reports 11, 1 (Jun 2021). https://doi.org/10.1038/s41598-021-91025-5
  25. Network motifs: simple building blocks of complex networks. Science 298, 5594 (2002), 824–827.
  26. Sunny Garlang Noah. 2007. The median of a continuous function. (2007).
  27. Shixuan Sun and Qiong Luo. 2022. Subgraph Matching With Effective Matching Order and Indexing. IEEE Trans. Knowl. Data Eng. 34, 1 (2022), 491–505.
  28. Sahil Verma and Julia Rubin. 2018. Fairness definitions explained. In 2018 ieee/acm international workshop on software fairness (fairware). IEEE, 1–7.
  29. HowSim: A General and Effective Similarity Measure on Heterogeneous Information Networks. In 2020 IEEE 36th International Conference on Data Engineering (ICDE). IEEE, 1954–1957.
  30. Cynthia I Wood and Illya V Hicks. 2015. The minimal k-core problem for modeling k-assemblies. The Journal of Mathematical Neuroscience (JMN) 5, 1 (2015), 1–19.
  31. Effective and Efficient Truss Computation over Large Heterogeneous Information Networks. In 2020 IEEE 36th International Conference on Data Engineering (ICDE). 901–912. https://doi.org/10.1109/ICDE48307.2020.00083
  32. Sirui Yao and Bert Huang. 2017. Beyond parity: Fairness objectives for collaborative filtering. Advances in neural information processing systems 30 (2017).
  33. Don Zagier. 1983. Inequalities for the Gini coefficient of composite populations. Journal of Mathematical Economics 12, 2 (1983), 103–118.
  34. Fairness-aware Maximal Clique in Large Graphs: Concepts and Algorithms. IEEE Transactions on Knowledge and Data Engineering (2023).
  35. Feng Zhao and Anthony KH Tung. 2012. Large scale cohesive subgraphs discovery for social network visual analysis. Proceedings of the VLDB Endowment 6, 2 (2012), 85–96.
  36. Efficient strategy mining for football social network. Complexity 2020 (2020), 1–11.
  37. Efficient Strategy Mining for Football Social Network. Complex. 2020 (2020), 8823189:1–8823189:11.
  38. Distributed Optimization of Graph Convolutional Network Using Subgraph Variance. IEEE Transactions on Neural Networks and Learning Systems (2023).

Summary

We haven't generated a summary for this paper yet.