To read this content please select one of the options below:

A hybrid approach for predicting missing follower–followee links in social networks using topological features with ensemble learning

Riju Bhattacharya (Department of Computer Science & Engineering, National Institute of Technology Raipur, Raipur, India)
Naresh Kumar Nagwani (Department of Computer Science & Engineering, National Institute of Technology Raipur, Raipur, India)
Sarsij Tripathi (Department of Computer Science & Engineering, Motilal Nehru National Institute of Technology, Allahabad, India)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 9 July 2022

Issue publication date: 17 March 2023

137

Abstract

Purpose

Social networking platforms are increasingly using the Follower Link Prediction tool in an effort to expand the number of their users. It facilitates the discovery of previously unidentified individuals and can be employed to determine the relationships among the nodes in a social network. On the other hand, social site firms use follower–followee link prediction (FFLP) to increase their user base. FFLP can help identify unfamiliar people and determine node-to-node links in a social network. Choosing the appropriate person to follow becomes crucial as the number of users increases. A hybrid model employing the Ensemble Learning algorithm for FFLP (HMELA) is proposed to advise the formation of new follower links in large networks.

Design/methodology/approach

HMELA includes fundamental classification techniques for treating link prediction as a binary classification problem. The data sets are represented using a variety of machine-learning-friendly hybrid graph features. The HMELA is evaluated using six real-world social network data sets.

Findings

The first set of experiments used exploratory data analysis on a di-graph to produce a balanced matrix. The second set of experiments compared the benchmark and hybrid features on data sets. This was followed by using benchmark classifiers and ensemble learning methods. The experiments show that the proposed (HMELA) method predicts missing links better than other methods.

Practical implications

A hybrid suggested model for link prediction is proposed in this paper. The suggested HMELA model makes use of AUC scores to predict new future links. The proposed approach facilitates comprehension and insight into the domain of link prediction. This work is almost entirely aimed at academics, practitioners, and those involved in the field of social networks, etc. Also, the model is quite effective in the field of product recommendation and in recommending a new friend and user on social networks.

Originality/value

The outcome on six benchmark data sets revealed that when the HMELA strategy had been applied to all of the selected data sets, the area under the curve (AUC) scores were greater than when individual techniques were applied to the same data sets. Using the HMELA technique, the maximum AUC score in the Facebook data set has been increased by 10.3 per cent from 0.8449 to 0.9479. There has also been an 8.53 per cent increase in the accuracy of the Net Science, Karate Club and USAir databases. As a result, the HMELA strategy outperforms every other strategy tested in the study.

Keywords

Citation

Bhattacharya, R., Nagwani, N.K. and Tripathi, S. (2023), "A hybrid approach for predicting missing follower–followee links in social networks using topological features with ensemble learning", Data Technologies and Applications, Vol. 57 No. 1, pp. 131-153. https://doi.org/10.1108/DTA-02-2022-0072

Publisher

:

Emerald Publishing Limited

Copyright © 2022, Emerald Publishing Limited

Related articles