Kybernetika 58 no. 2, 277-300, 2022

A new overlapping community detection algorithm based on similarity of neighbors in complex networks

Pelin Çetin and Sahin Emrah AmrahovDOI: 10.14736/kyb-2022-2-0277

Abstract:

Community detection algorithms help us improve the management of complex networks and provide a clean sight of them. We can encounter complex networks in various fields such as social media, bioinformatics, recommendation systems, and search engines. As the definition of the community changes based on the problem considered, there is no algorithm that works universally for all kinds of data and network structures. Communities can be disjointed such that each member is in at most one community or overlapping such that every member is in at least one community. In this study, we examine the problem of finding overlapping communities in complex networks and propose a new algorithm based on the similarity of neighbors. This algorithm runs in $ O(m \textit{lg} m) $ running time in the complex network containing $ m $ number of relationships. To compare our algorithm with existing ones, we select the most successful four algorithms from the Community Detection library (CDlib) by eliminating the algorithms that require prior knowledge, are unstable, and are time-consuming. We evaluate the successes of the proposed algorithm and the selected algorithms using various known metrics such as modularity, F-score, and Normalized Mutual Information. In addition, we adapt the coverage metric defined for disjoint communities to overlapping communities and also make comparisons with this metric. We also test all of the algorithms on small graphs of real communities. The experimental results show that the proposed algorithm is successful in finding overlapping communities.

Keywords:

complex networks, overlapping community detection, graph approach, similarity approach, community metrics

Classification:

94C15

References:

  1. S. Aghaalizadeh, S. T. Afshord, A. Bouyer and B. Anari: A three-stage algorithm for local community detection based on the high node importance ranking in social networks. Phys. A 563 (2021), 125420.   DOI:10.1016/j.physa.2020.125420
  2. S. Agrawal and A. Patel: SAG cluster: An unsupervised graph clustering based on collaborative similarity for community detection in complex networks. Phys. A 563 (2021), 125459.   DOI:10.1016/j.physa.2020.125459
  3. Y. Y. Ahn, J. P. Bagrow and S. Lehmann: Link communities reveal multiscale complexity in networks. Nature 466 (2010), 761-764.   DOI:10.1038/nature09182
  4. M. Arab and M. Hasheminezhad: Limitations of quality metrics for community detection and evaluation. In: 3th International Conference on Web Research (ICWR), Tehran 2017.   CrossRef
  5. J. P. Attal, M. Malek and M. Zolghadri: Overlapping community detection using core label propagation algorithm and belonging functions. Appl. Intell. 51 (2021), 8067-8087.   DOI:10.1007/s10489-021-02250-4
  6. CDlib: Community Discovery Library.    CrossRef
  7. T. Chakraborty, A. Dalmia, A. Mukherjee and N. Ganguly: Metrics for community analysis: A survey. ACM Computing Surveys 50 (2018), 1-37.   DOI:10.1145/3139222
  8. D. Chen, M. Shang, Z. Lv and Y. Fu: Detecting overlapping communities of weighted networks via a local algorithm. Phys. A 389 (2010), 4177-4187.   DOI:10.1016/j.physa.2010.05.046
  9. A. Choumane, A. Awada and A. Harkous: Core expansion: a new community detection algorithm based on neighborhood overlap. Soc. Netw. Anal. Min. 10 (2020), 30.   DOI:10.1007/s13278-020-00647-6
  10. M. Coscia, G. Rossetti, F. Giannotti and D. Pedreschi: DEMON: A local-first discovery method for overlapping communities. In: Proc. 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing 2012.   CrossRef
  11. Ş. Emrah Amrahov and B. Tugrul: A community detection algorithm on graph data. In: 2018 International Conference on Artificial Intelligence and Data Processing (IDAP), Malatya 2018.   CrossRef
  12. I. Farkas, D. Ábel, G. Palla and T. Vicsek: Weighted network modules. New J. Phys. 9 (2007), 180.   DOI:10.1177/1461444807072228
  13. L. Feng, Q. Zhao and C. Zhou: Incorporating affiliation preference into overlapping community detection. Phys. A 563 (2021), 125429.   DOI:10.1016/j.physa.2020.125429
  14. S. Fortunato: Community detection in graphs. Phys. Rep. 486 (2009), 75-174.   DOI:10.1016/j.physrep.2009.11.002
  15. Y. Gao, X. Yu and H. Zhang: Overlapping community detection by constrained personalized pagerank. Expert Syst. Appl. 173 (2021), 114682.   DOI:10.1016/j.eswa.2021.114682
  16. J. Ge, H. Sun, C. Xue, L. He, X. Jia, H. He and J. Chen: LPX: Overlapping community detection based on X-means and label propagation algorithm in attributed networks. Comput. Intell. 37 (2021), 484-510.   DOI:10.1111/coin.12420
  17. P. K. Gopalan and D. M. Blei: Efficient discovery of overlapping communities in massive networks. Proc. Natl. Acad. Sci. USA 110 (2013), 14534-14539.   DOI:10.1073/pnas.1221839110
  18. S. Goswami and A. K. Das: Determining maximum cliques for community detection in weighted sparse networks. Knowl. Inf. Syst. 64 (2022), 289-324.   DOI:10.1007/s10115-021-01631-y
  19. S. Gregory: An algorithm to find overlapping community structure in networks. In: Proc. 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, Warsaw 2007.   CrossRef
  20. S. Gregory: A fast algorithm to find overlapping communities in networks. In: Machine Learning and Knowledge Discovery in Databases, Berlin 2008.   CrossRef
  21. S. Gregory: Finding overlapping communities in networks by label propagation. New J. Phys. 12 (2010), 103018.   DOI:10.1088/1367-2630/12/10/103018
  22. K. Guo, Q. Wang, J. Lin, L. Wu, W. Guo and K.-M. Chao: Network representation learning based on community-aware and adaptive random walk for overlapping community detection. Appl. Intell. 52 (2022), 9919-9937.   DOI:10.1007/s10489-021-02999-8
  23. S. Gupta and P. Kumar: An overlapping community detection algorithm based on rough clustering of links. Data Knowl. Engrg. 125 (2020), 101777.   DOI:10.1016/j.datak.2019.101777
  24. C. He, H. Liu, Y. Tang, S. Liu, X. Fei, Q. Cheng and H. Li: Similarity preserving overlapping community detection in signed networks. Future Gener. Comput. Syst. 116 (2021), 275-290.   DOI:10.1016/j.future.2020.10.034
  25. D. Jin, B. Gabrys and J. Dang: Combined node and link partitions method for finding overlapping communities in complex networks. Sci. Rep. 5 (2015), 8600.   DOI:10.1038/srep08600
  26. H. S. Joghan, A. Bagheri and M. Azad: Weighted label propagation based on local edge betweenness. J. Supercomput. 75 (2019), 8094-8114.   DOI:10.1007/s11227-019-02978-4
  27. J. Kim, S. Lim, J. G. Lee and B. S. Lee: LinkBlackHole*: Robust overlapping community detection using link embedding. IEEE Trans. Knowl. Data Engrg. 31 (2019), 2138-2150.   DOI:10.1109/TKDE.2018.2873750
  28. I. B. E. Kouni, W. Karoui and L. B. Romdhane: Node importance based label propagation algorithm for overlapping community detection in networks. Expert Syst. Appl. 162 (2020), 113020.   DOI:10.1016/j.eswa.2019.113020
  29. A. Lancichinetti, S. Fortunato and J. Kertesz: Detecting the overlapping and hierarchical community structure in complex networks. New J. Phys. 11 (2009), 033015.   DOI:10.1088/1367-2630/11/3/033015
  30. C. Lee, F. Reid, A. McDaid and N. Hurley: Detecting highly overlapping community structure by greedy clique expansion.    CrossRef
  31. C. Li, H. Chen, T. Li and X. Yang: A stable community detection approach for complex network based on density peak clustering and label propagation. Appl. Intell. 52 (2021), 1188-1208.   DOI:10.1007/s10489-021-02287-5
  32. H. V. Lierde, G.S. Member and T. W. S. Chow: Scalable spectral clustering for overlapping community detection in large-scale networks. IEEE Trans. Knowl. Data. Engrg. 32 (2020), 754-767.   DOI:10.1109/TKDE.2019.2892096
  33. S. Lim, S. Ryu, S. Kwon, K. Jung and J. G. Lee: LinkSCAN: Overlapping community detection using the link-space transformation. In: IEEE 30th International Conference on Data Engineering, Chicago 2014.   CrossRef
  34. H. Long: Overlapping community detection with least replicas in complex networks. Inform. Sci. 453 (2018), 216-226.   DOI:10.1016/j.ins.2018.03.063
  35. H. Lu, Z. Zhang, Z. Qu and Y. Kang: LPANNI: Overlapping community detection using label propagation in large-scale complex networks. IEEE Trans. Knowl. Data. Engrg. 31 (2019), 1736-1749.   DOI:10.1109/TKDE.2018.2866424
  36. T. Ma, Q. Liu, J. Cao, Y. Tian, A. Al-Dhelaan and M. Al-Rodhaan: LGIEM: Global and local node influence based community detection. Future Gener. Comput. Syst. 105 (2020), 533-546.   DOI:10.1016/j.future.2019.12.022
  37. T. Ma, Y. Wang, M. Tang, J. Cao, Y. Tian, A. Al-Dhelaan and M. Al-Rodhaan: LED: A fast overlapping communities detection algorithm based on structural clustering. Neurocomputing 207 (2016), 488-500.   DOI:10.1016/j.neucom.2016.05.020
  38. A. Mahabadi and M. Hosseini: SLPA-based parallel overlapping community detection approach in large complex social networks. Multimed. Tools. Appl. 80 (2021), 6567-6598.   DOI:10.1007/s11042-020-09993-1
  39. A. D. McCarthy, T. Chen, R. Rudinger and D. W. Matula: Metrics matter in community detection. In: Complex Networks and Their Applications VIII (H. Cherifi, S. Gaito, J. Mendes, E. Moro, L. Rocha, eds.), Stud. Comput. Intell., 881, Springer 2019, pp. 164-175.   DOI:10.1007/978-3-030-36687-2\_14
  40. A. McDaid, D. Greene and N. Hurley: Normalized mutual information to evaluate overlapping community finding algorithms.    CrossRef
  41. T. Nepusz, A. Petróczi, L. Négyessy and F. Bazsó: Fuzzy communities and the concept of bridgeness in complex networks. Phys. Rev. E 77 (2008), 1-12.   DOI:10.1103/physreve.77.016107
  42. M. E. J. Newman and M. Girvan: Finding and evaluating community structure in networks. Phys. Rev. E 69 (2004), 1-15.   DOI:10.1103/physreve.69.026113
  43. G. Palla, I. Derényi, I. Farkas and T. Vicsek: Uncovering the overlapping community structure of complex networks in nature and society. Nature 435 (2005), 814-818.   DOI:10.1038/nature03607
  44. B. Pattabiraman, M. A. Patwary, A. H. Gebremedhin, W. K. Liao and A. Choudhary: Fast algorithms for the maximum clique problem on massive graphs with applications to overlapping community detection. Internet Math. 11 (2015), 421-448.   DOI:10.1080/15427951.2014.986778
  45. H. S. Pattanayak, H. K. Verma and A. L. Sangal: Community detection metrics and algorithms in social networks. In: First International Conference on Secure Cyber Computing and Communication (ICSCCC), Jalandhar 2018, pp. 483-489.   DOI:10.1109/icsccc.2018.8703215
  46. I. Psorakis, S. Roberts, M. Ebden and B. Sheldon: Overlapping community detection using Bayesian non-negative matrix factorization. Phys. Rev. E 83 (2011), 066114.   DOI:10.1103/physreve.83.066114
  47. M. Qin and K. Lei: Dual-channel hybrid community detection in attributed networks. Inform. Sci. 551 (2021), 146-167.   DOI:10.1016/j.ins.2020.11.010
  48. H. Shen, X. Cheng, K. Cai and M-B. Hu: Detect overlapping and hierarchical community structure in networks. Phys. A 388 (2009), 1706-1712.   DOI:10.1016/j.physa.2008.12.021
  49. Z. Sun, B. Wang, J. Sheng, Z. Yu and J. Shao: Overlapping community detection based on information dynamics. IEEE Access 6 (2018), 70919-70934.   DOI:10.1109/ACCESS.2018.2879648
  50. X. Wang, G. Liu and J. Li: Overlapping community detection based on structural centrality in complex networks. IEEE Access 5 (2017), 25258-25269.   DOI:10.1109/ACCESS.2017.2769484
  51. Z. X. Wang, Z. C. Li, X. F. Ding and J. H. Tang: Overlapping community detection based on node location analysis. Knowl. Based Syst. 105 (2016), 225-235.   DOI:10.1016/j.knosys.2016.05.024
  52. X. Wen, W. N. Chen, Y. Lin, T. Gu, H. Zhang, Y. Li, Y. Yin and J. Zhang: A maximal clique based multiobjective evolutionary algorithm for overlapping community detection. IEEE Trans. Evol. Comput. 21 (2017), 363-377.   DOI:10.1109/TEVC.2016.2605501
  53. J. J. Whang, F. David and I. S. Dhillon: Overlapping community detection using neighborhood-inflated seed expansion. IEEE Trans. Knowl. Data. Engrg. 28 (2016), 1272-1284.   DOI:10.1109/TKDE.2016.2518687
  54. Q. Wu, R. Chen, L. Wang and K. Guo: A label propagation algorithm for community detection on high-mixed networks. Concurr. Comput. Pract. Exp. 33 (2021).   DOI:10.1002/cpe.6141
  55. J. Xie, B. K. Szymanski and X. Liu: SLPA: Uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process. In: IEEE 11th International Conference on Data Mining Workshops, Vancouver 2011, pp. 344-349.   DOI:10.1109/icdmw.2011.154
  56. H. Yu, R. Ma, J. Chao and F. Zhang: An overlapping community detection approach based on DeepWalk and improved label propagation. IEEE Trans. Comput. Soc. Syst. (2022), 1-11.   DOI:10.1109/tcss.2022.3152579
  57. S. Zhang, R. S. Wang and X. S. Zhang: Uncovering fuzzy community structure in complex networks. Phys. Rev. E 76 (2007), 046103.   DOI:10.1103/physreve.76.046103
  58. Y. Zhang and D. Y. Yeung: Overlapping community detection via bounded nonnegative matrix tri-factorization. In: Proc. 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing 2012, pp. 606-614.   DOI:10.1145/2339530.2339629
  59. Q. Zhou, S. Cai and Y. Zhang: Detecting overlapping community structure with node influence. IEEE Access 7 (2019), 171223-171234.   DOI:10.1109/ACCESS.2019.2955161