GPIC: A GPU-based parallel independent cascade algorithm in complex networks

Independent cascade (IC) models, by simulating how one node can activate another, are important tools for studying the dynamics of information spreading in complex networks. However, traditional algorithms for the IC model implementation face significant efficiency bottlenecks when dealing with large-scale networks and multi-round simulations. To settle this problem, this study introduces a GPU-based parallel independent cascade (GPIC) algorithm, featuring an optimized representation of the network data structure and parallel task scheduling strategies. Specifically, for this GPIC algorithm, we propose a network data structure tailored for GPU processing, thereby enhancing the computational efficiency and the scalability of the IC model. In addition, we design a parallel framework that utilizes the full potential of GPU’s parallel processing capabilities, thereby augmenting the computational efficiency. The results from our simulation experiments demonstrate that GPIC not only preserves accuracy but also significantly boosts efficiency, achieving a speedup factor of 129 when compared to the baseline IC method. Our experiments also reveal that when using GPIC for the independent cascade simulation, 100–200 simulation rounds are sufficient for higher-cost studies, while high precision studies benefit from 500 rounds to ensure reliable results, providing empirical guidance for applying this new algorithm to practical research.

HTML

1. Introduction

With the rise of social media,^[1,2] the study of information dissemination in online social networks^[3] has become a core issue. The independent cascade (IC) model,^[4,5] with its strong ability to describe information dissemination in networks, has attracted much attention. It can help identify vital influencers^[6–8] and optimize information dissemination strategies.^[9–11] However, the computational complexity of the IC model will increase dramatically with the growth of network size, limiting its application in social networks. For one thing, social network experiments often require real-time feedback and dynamic updates, which is beyond the computational ability of traditional algorithms. For another, in information dissemination research, multiple parameters are usually considered, leading to a significant computational burden that can slow down the progress of research work. For example, when Goldenberg used the IC model to simulate the word-of-mouth propagation,^[4] he set 5 parameters, each with 7 values, and this required a total of 7⁵ (16807) information dissemination scenarios to be simulated. If the research was expanded to N nodes, with 100 rounds per scenario, then it would require N × 16807 × 100 simulations to calculate the information dissemination effect of each individual, which would take more than twenty years of computation on a single machine.

To overcome this challenge, researchers have begun to explore parallel computing technologies in recent years to improve the scalability and efficiency of simulation algorithms. The graphics processing unit (GPU),^[12] distinguished by its pronounced parallel processing capabilities and cost-effectiveness, has emerged as a formidable instrument for expediting extensive computational tasks. For instance, in 2012, the AlexNet^[13] leveraged GPU to enhance the training velocity, and this helped surpass the capabilities of CPU-based model training, achieving significant success in the realm of computer vision. From then on, acceleration algorithms based on GPUs have become a foundational tool across domains such as artificial intelligence, machine learning, and physics.^[14–16] However, within the direction of information dissemination, there is yet to be in-depth research^[17–19] on effectively mapping the IC model to the GPU architecture.^[20–22] To achieve this, it necessitates a meticulous approach to algorithmic design, ensuring that the full spectrum of GPU’s parallel processing prowess is harnessed.

In this study, we propose a GPU-based parallel independent cascade (GPIC) algorithm to accelerate the simulation process, which is more efficient and less time-consuming compared with previous IC algorithms. Specifically, the algorithm improves the efficiency and extensibility of the IC model through optimized network data structure and parallel task scheduling strategies.

The main contributions of this paper include: (i) proposing and implementing a parallel algorithm for the IC model tailored for GPU processing; (ii) designing a network data structure and efficient memory access patterns, as well as decomposing and paralleling tasks to enhance the efficiency of the IC model; (iii) validating the accuracy and efficiency of the algorithm through extensive experiments; (iv) providing application examples of the algorithm in actual information dissemination simulations, and demonstrating its potential in practical research problems.

This efficient GPIC algorithm can be used as a fundamental tool to simulate information spreading and has the potential to promote research on information dynamics in complex networks.

The paper is organized as follows: Section 2 involves the method related to the GPIC algorithm. Section 3 introduces the detailed designs of GPIC, including the new data structure, parallel simulation framework, and optimization strategies. Section 4 evaluates the accuracy and efficiency of GPIC. Section 5 applies GPIC in Monte Carlo simulation. In Section 6, we conclude the study by summarizing our findings, discussing their implications, and suggesting future research directions.

2. Method

2.1. Independent cascade model

The independent cascade (IC) model,^[4,5] as one of the information dissemination models, was first proposed by Goldenberg et al. As a probabilistic model, it assumes that the success of node u’s attempt to activate its adjacent node v is an event with a probability of p(u,v). Moreover, the probability of an inactive node being activated by a newly active neighbor node is independent of the activation attempts of previous neighbors. In addition, the model also assumes that node u has only one opportunity to activate its neighbor node v, regardless of success or failure. After this, although u itself remains active, it no longer has the ability to influence others and such nodes are called active nodes without influence. Following is the activating process of the IC model.

1. Define the initial set of active nodes A.

2. At time t, the newly activated node u influences its adjacent node v with a success probability of p(u,v). If v has multiple neighbors that have just been activated, these nodes will attempt to activate node v in any order.

3. If node v is successfully activated, at time t + 1, node v becomes active and will influence its adjacent inactive nodes; otherwise, the state of node v does not change at time t + 1.

4. This process continues until there is no more influential active node in the network and the spreading process ends.

In the later part of this paper, we will take this IC model implemented with Python as the baseline algorithm and compare it with GPIC to examine the performance of the new algorithm.

2.2. GPU and CUDA

GPU has emerged as the quintessential option for parallel computation, primarily attributed to its distinctive hardware configuration and innovative design principles. Compared with CPU, GPU has more processing cores that can execute tens of thousands of threads simultaneously, providing a hardware foundation for large-scale parallel processing. Its architecture mainly includes:

Streaming multiprocessor (SM) is a processing unit in the GPU, which includes a set of processing cores, registers, shared memory and other supporting structures. Each SM is endowed with the autonomy to concurrently manage and execute a cohort of threads, taking charge of the orchestration and operational execution of the threads it oversees.

Streaming processor (SP) is the execution unit within SM. An SM contains multiple SPs, which are the most basic processing units for executing computational tasks.

Compute unified device architecture (CUDA) is a parallel computing platform and programming model proposed by NVIDIA. Developers can directly use NVIDIA’s GPUs for general computing based on the CUDA API. CUDA’s parallel computing model adopts a hierarchical management method to effectively organize and schedule threads through the combination of grid, block and thread to achieve highly flexible parallel processing. In terms of memory access, CUDA provides operation support for multiple memory types, including registers, local memory, shared memory and global memory. This design allows developers to perform targeted classification processing based on the performance requirements of the data and optimize the data access efficiency during the calculation process. In terms of computation, it is achieved through kernel functions. Developers can optimize the kernel function to ensure that both SM and SP participate in the calculation as much as possible, thereby improving the overall computing efficiency.

2.3. Datasets

This study compares the efficiency of the proposed GPIC algorithm with baseline algorithm using two different types of artificial networks: Barabási–Albert networks (BA)^[23] and Erdős–Rényi networks (ER).^[24] Table 1 presents the parameters of the BA and ER networks, where 〈k〉 is the average degree,^[2]k^* is the maximum degree, and c is the clustering coefficient.^[25] The power-law exponent of BA networks is set to 2. The specific details are as follows.

We also use the Watts–Strogatz (WS) network^[26] to examine our algorithm. The WS network has a higher clustering coefficient, and its average degree and maximum degree are similar to that of the ER network, so is the acceleration effect.

6. Conclusion and perspectives

This study proposes a GPU-based parallel algorithm to implement the independent cascade model, with the aim to solve the efficiency bottleneck problem faced by traditional algorithms when using the IC model with Monte Carlo simulation.

By designing new network data structures and efficient memory access patterns, as well as decomposing and paralleling tasks, the proposed algorithm significantly improves the efficiency and scalability of the IC model. The main contributions of this study include:

Algorithmic innovation: We propose a GPU-based parallel algorithm to implement the IC model, achieving high efficiency through GPUs’ high parallel processing capabilities.

Performance improvement: We have proved that the algorithm can improve computational efficiency compared with baseline experiments, including datasets with different scales and rounds of simulations.

Significant application value: We have determined the optimal number of simulation rounds under limited computing power, providing practical insights for related research.

The GPIC algorithm holds extensive potential applications across various fields, extending beyond the scope of traditional information dissemination research. It can be utilized to analyze the spread of trending topics on social media, evaluate the effectiveness of advertising campaigns, and assess strategies for public opinion guidance in large-scale social networks. Additionally, it can rapidly validate the effectiveness of epidemic models in the context of epidemic spread simulation. The GPIC algorithm is also valuable for evaluating algorithms related to spreading dynamics and identifying vital nodes within networks.^[39] Furthermore, its versatility allows it to be applied in interdisciplinary spreading research, such as in economics, ecosystems, and cybersecurity, where spreading dynamics play a crucial role.

The proposed GPU-based parallel IC algorithm has achieved remarkable results in improving simulation efficiency, but there are still some limitations that need to be improved in the future.

1) Optimizing parameter setting The current GPIC algorithm sets a fixed parameter value for the batch size involved in task decomposition. However, this value should be adjusted based on the hardware parameters of the GPU and the specific tasks at hand. Therefore, the GPIC algorithm can be optimized in the future to adaptively calculate more optimal parameter values according to different scenarios.

2) Enhancing multi-GPU support The current algorithm is mainly designed and optimized for a single GPU, and it can be optimized to extend to a multi-machine and multi-GPU computing environment, enabling parallel simulation on large-scale networks.

3) Expanding to diverse propagation models The current algorithm is designed for the independent cascade model, and it is expected to be expanded to more propagation models such as SI, SIR, and SIS,^[40] therefore performing more extensive spreading simulations.

4) Broadening practical applications Our algorithm can be applied to other simulation scenarios, such as virus spreading simulation,^[41,42] public opinion dissemination, etc., to verify its practicality and generalization ability.

In conclusion, this study demonstrates the feasibility of a GPU-based parallel algorithm on social network information simulation. It improves the efficiency of the IC model tremendously compared with the traditional algorithm. With the development of computing technology and the growth of network size, GPIC will provide an important tool and reference for research in social network fields.

Program availability

The source code is openly available in the Science Data Bank at https://doi.org/10.57760/sciencedb.j00113.00183.

Figure (9) Table (3) Reference (42)

[1]	Barabási A L 2013 Phil. Trans. R. Soc. A 371 20120375 doi: 10.1098/rsta.2012.0375 CrossRef Google Scholar Pub Med
[2]	Newman M 2018 Networks (Oxford University Press) New York Google Scholar Pub Med
[3]	Luding S 2005 Nature 435 159 doi: 10.1038/435159a CrossRef Google Scholar Pub Med
[4]	Goldenberg J, Libai B and Muller E 2001 Market. Lett. 12 211 doi: 10.1023/A:1011122126881 CrossRef Google Scholar Pub Med
[5]	Goldenberg J, Libai B and Muller E 2001 Acad. Market. Sci. Rev. 9 1 doi: 10.1509/jmkg.66.2.1.18472 CrossRef Google Scholar Pub Med
[6]	Chen D, Lü L, Shang M S, Zhang Y C and Zhou T 2012 Physica A 391 1777 doi: 10.1016/j.physa.2011.09.017 CrossRef Google Scholar Pub Med
[7]	Lü L, Chen D, Ren X L, Zhang Q M, Zhang Y C and Zhou T 2016 Phys. Rep. 650 1 doi: 10.1016/j.physrep.2016.06.007 CrossRef Google Scholar Pub Med
[8]	Malliaros F D, Rossi M E G and Vazirgiannis M 2016 Sci. Rep. 6 19307 doi: 10.1038/srep19307 CrossRef Google Scholar Pub Med
[9]	Granovetter M S 1973 Am. J. Sociol. 78 1360 doi: 10.1086/225469 CrossRef Google Scholar Pub Med
[10]	Granovetter M 1983 Sociol. Theor. 1 201 doi: 10.2307/202051 CrossRef Google Scholar Pub Med
[11]	Lü L and Zhou T 2011 Physica A 390 1150 doi: 10.1016/j.physa.2010.11.027 CrossRef Google Scholar Pub Med
[12]	Google Scholar Pub Med
[13]	Krizhevsky A, Sutskever I and Hinton G E 2017 Communications of the ACM 60 84 doi: 10.1145/3065386 CrossRef Google Scholar Pub Med
[14]	Huang Y P, Xia Y, Yang L, Wei J, Yang Y I and Gao Y Q 2022 Chin. J. Chem. 40 160 doi: 10.1002/cjoc.202100456 CrossRef Google Scholar Pub Med
[15]	Wang B, Wald I, Morrical N, Usher W, Mu L, Thompson K and Hughes R 2022 Comput. Phys. Commun. 271 108221 doi: 10.1016/j.cpc.2021.108221 CrossRef Google Scholar Pub Med
[16]	Zhang Y H, Zhu H Z, Dong Y J, Zeng J, Han X P, Bratchenko I A, Zhang F R, Xu S Y and Wang S 2023 Chin. Phys. B 32 118702 doi: 10.1088/1674-1056/acef05 CrossRef Google Scholar Pub Med
[17]	Zhang B D, Tang Y H, Wu J J and Li X 2011 Chin. Phys. B 20 098901 doi: 10.1088/1674-1056/20/9/098901 CrossRef Google Scholar Pub Med
[18]	Du Nguyen H A, Al-Ars Z, Smaragdos G and Strydis C 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE) Grenoble, France Google Scholar Pub Med
[19]	Gao M, Li Z, Li R, Cui C, Chen X, Ye B, Li Y, Gu W, Gong Q, Wang X and Chen Y 2023 Patterns 4 100839 doi: 10.1016/j.patter.2023.100839 CrossRef Google Scholar Pub Med
[20]	Zheng Z, Shi X and Jin H 2023 IEEE Transactions on Big Data 9 677 doi: 10.1109/TBDATA.2022.3180360 CrossRef Google Scholar Pub Med
[21]	Liu X, Li M, Li S, Peng S, Liao X and Lu X 2014 IEEE Transactions on Parallel and Distributed Systems 25 136 doi: 10.1109/TPDS.2013.41 CrossRef Google Scholar Pub Med
[22]	Zou P, Lü Y s, Wu L d, Chen L l and Yao Y p 2013 Simulation 89 1154 doi: 10.1177/0037549713482026 CrossRef Google Scholar Pub Med
[23]	Barabási A L and Albert R 1999 Science 286 509 doi: 10.1126/science.286.5439.509 CrossRef Google Scholar Pub Med
[24]	Erdős P and Rényi A 1960 Publ. Math. Inst. Hung. Acad. Sci. 5 17 Google Scholar Pub Med
[25]	Eggemann N and Noble S D 2011 Discrete Appl. Math. 159 953 doi: 10.1016/j.dam.2011.02.003 CrossRef Google Scholar Pub Med
[26]	Watts D J and Strogatz S H 1998 Nature 393 440 doi: 10.1038/30918 CrossRef Google Scholar Pub Med
[27]	Liu J G, Lin J H, Guo Q and Zhou T 2016 Sci. Rep. 6 21380 doi: 10.1038/srep21380 CrossRef Google Scholar Pub Med
[28]	Kempe D, Kleinberg J and Tardos É 2003 Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Washington, D.C, USA Google Scholar Pub Med
[29]	Kitsak M, Gallos L K, Havlin S, Liljeros F, Muchnik L, Stanley H E and Makse H A 2010 Nat. Phys. 6 888 doi: 10.1038/nphys1746 CrossRef Google Scholar Pub Med
[30]	Iannelli F, Mariani M S and Sokolov I M 2018 Phys. Rev. E 98 062302 doi: 10.1103/PhysRevE.98.062302 CrossRef Google Scholar Pub Med
[31]	Awerbuch B, Goldberg A V, Luby M and Plotkin S A 1989 30th Annual Symposium on Foundations of Computer Science (Research Triangle Park) NC, USA Google Scholar Pub Med
[32]	Ren X L, Gleinig N, Helbing D and Antulov-Fantulin N 2019 Proc. Natl. Acad. Sci. USA 116 6554 doi: 10.1073/pnas.1806108116 CrossRef Google Scholar Pub Med
[33]	Jajodia S, Noel S and O’berry B 2005 Topological analysis of network attack vulnerability (Springer) Boston, MA Google Scholar Pub Med
[34]	Katz E and Lazarsfeld P F 1955 Personal influence: The part played by people in the flow of mass communications (Routledge) New York Google Scholar Pub Med
[35]	Watts D J and Dodds P S 2007 J. Consum. Res. 34 441 doi: 10.1086/518527 CrossRef Google Scholar Pub Med
[36]	Leskovec J, Kleinberg J and Faloutsos C 2007 ACM Trans. Knowl. Discov. Data 1 2 doi: 10.1145/1217299.1217301 CrossRef Google Scholar Pub Med
[37]	Leskovec J, Lang K J, Dasgupta A and Mahoney M W 2009 Internet Mathematics 6 29 doi: 10.1080/15427951.2009.10129177 CrossRef Google Scholar Pub Med
[38]	Yang J and Leskovec J 2012 IEEE 12th International Conference on Data Mining Brussels, Belgium Google Scholar Pub Med
[39]	Yang P L, Zhao L J, Dong C, Xu G Q and Zhou L X 2023 Chin. Phys. B 32 058901 doi: 10.1088/1674-1056/ac8e56 CrossRef Google Scholar Pub Med
[40]	Keeling M J and Rohani P 2008 Modeling Infectious Diseases in Humans and Animals (Princeton University Press) Princeton Google Scholar Pub Med
[41]	Liu S, Perra N, Karsai M and Vespignani A 2014 Phys. Rev. Lett. 112 118702 doi: 10.1103/PhysRevLett.112.118702 CrossRef Google Scholar Pub Med
[42]	Pastor-Satorras R, Castellano C, Van Mieghem P and Vespignani A 2015 Rev. Mod. Phys. 87 925 doi: 10.1103/RevModPhys.87.925 CrossRef Google Scholar Pub Med

GPIC: A GPU-based parallel independent cascade algorithm in complex networks

Abstract

References

Access History

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Access History