Influence Maximization: Near-Optimal Time Complexity Meets Practical Efficiency

被引:563
|
作者
Tang, Youze [1 ]
Xiao, Xiaokui [1 ]
Shi, Yanchen [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
Algorithms; Theory; Experimentation;
D O I
10.1145/2588555.2593670
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a social network G and a constant k, the influence maximization problem asks for k nodes in G that (directly and indirectly) influence the largest number of nodes under a pre-defined diffusion model. This problem finds important applications in viral marketing, and has been extensively studied in the literature. Existing algorithms for influence maximization, however, either trade approximation guarantees for practical efficiency, or vice versa. In particular, among the algorithms that achieve constant factor approximations under the prominent independent cascade (IC) model or linear threshold (LT) model, none can handle a million-node graph without incurring prohibitive overheads. This paper presents TIM, an algorithm that aims to bridge the theory and practice in influence maximization. On the theory side, we show that TIM runs in O((k + l)(n + m) log n/epsilon(2)) expected time and returns a (1 - 1/e - epsilon)-approximate solution with at least 1 - n(-l) probability. The time complexity of TIM is near-optimal under the IC model, as it is only a log n factor larger than the Omega(m + n) lower-bound established in previous work (for fixed k, l, and epsilon). Moreover, TIM supports the triggering model, which is a general diffusion model that includes both IC and LT as special cases. On the practice side, TIM incorporates novel heuristics that significantly improve its empirical efficiency without compromising its asymptotic performance. We experimentally evaluate TIM with the largest datasets ever tested in the literature, and show that it outperforms the state-of-the-art solutions (with approximation guarantees) by up to four orders of magnitude in terms of running time. In particular, when k = 50, epsilon = 0.2, and l = 1, TIM requires less than one hour on a commodity machine to process a network with 41.6 million nodes and 1.4 billion edges. This demonstrates that influence maximization algorithms can be made practical while still offering strong theoretical guarantees.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 50 条
  • [21] Connected Components in LinearWork and Near-Optimal Time
    Farhadi, Alireza
    Liu, Sixue Cliff
    Shi, Elaine
    PROCEEDINGS OF THE 36TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, SPAA 2024, 2024, : 395 - 402
  • [22] Near-Optimal MIMO Detection Algorithm with Low and Fixed Complexity
    Kim, Hyunsub
    Kim, Jaeseok
    2015 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE), 2015,
  • [23] NEAR-OPTIMAL SAMPLE COMPLEXITY BOUNDS FOR CIRCULANT BINARY EMBEDDING
    Oymak, Samet
    Thrampoulidis, Christos
    Hassibi, Babak
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6359 - 6363
  • [24] Complexity of near-optimal robust versions of multilevel optimization problems
    Besancon, Mathieu
    Anjos, Miguel F.
    Brotcorne, Luce
    OPTIMIZATION LETTERS, 2021, 15 (08) : 2597 - 2610
  • [25] Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity
    Sidford, Aaron
    Wang, Mengdi
    Yang, Lin F.
    Ye, Yinyu
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [26] Near-Optimal Search Time in δ-Optimal Space, and Vice Versa
    Kociumaka, Tomasz
    Navarro, Gonzalo
    Olivares, Francisco
    ALGORITHMICA, 2024, 86 (04) : 1031 - 1056
  • [27] Composable Core-sets for Determinant Maximization: A Simple Near-Optimal Algorithm
    Indyk, Piotr
    Mahabadi, Sepideh
    Gharan, Shayan Oveis
    Rezaei, Alireza
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [28] Truthful and Near-Optimal Mechanisms for Welfare Maximization in Multi-Winner Elections
    Bhaskar, Umang
    Dani, Varsha
    Ghosh, Abheek
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 924 - 931
  • [29] Arikan Meets Shannon: Polar Codes with Near-Optimal Convergence to Channel Capacity
    Guruswami, Venkatesan
    Riazanov, Andrii
    Ye, Min
    PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20), 2020, : 552 - 564
  • [30] Achieving Optimal Throughput and Near-Optimal Asymptotic Delay Performance in Multichannel Wireless Networks With Low Complexity: A Practical Greedy Scheduling Policy
    Ji, Bo
    Gupta, Gagan R.
    Sharma, Manu
    Lin, Xiaojun
    Shroff, Ness B.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2015, 23 (03) : 880 - 893