Sampling Representative Users from Large Social Networks

被引:0
|
作者
Tang, Jie [1 ,2 ]
Zhang, Chenhui [1 ,2 ]
Cai, Keke [3 ]
Zhang, Li [3 ]
Su, Zhong [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] TNList, Beijing, Peoples R China
[3] IBM Corp, China Res Lab, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding a subset of users to statistically represent the original social network is a fundamental issue in Social Network Analysis (SNA). The problem has not been extensively studied in existing literature. In this paper, we present a formal definition of the problem of sampling representative users from social network. We propose two sampling models and theoretically prove their NP-hardness. To efficiently solve the two models, we present an efficient algorithm with provable approximation guarantees. Experimental results on two datasets show that the proposed models for sampling representative users significantly outperform (+6%-23% in terms of Precision@100) several alternative methods using authority or structure information only. The proposed algorithms are also effective in terms of time complexity. Only a few seconds are needed to sampling 300 representative users from a network of 100,000 users. All data and codes are publicly available.(1)
引用
收藏
页码:304 / 310
页数:7
相关论文
共 50 条
  • [1] Fast Representative Sampling in Large-Scale Online Social Networks
    Cai, Guangren
    Lu, Gang
    Guo, Junxia
    Ling, Cheng
    Li, Ruiqi
    IEEE ACCESS, 2020, 8 : 77106 - 77119
  • [2] Representative Sampling of Users? To the Contrary
    Carswell, Melody
    Rinaldo, Shannon
    Stephens, Eric
    ERGONOMICS IN DESIGN, 2005, 13 (01) : 25 - +
  • [3] SAMPLING AND ESTIMATION IN LARGE SOCIAL NETWORKS
    FRANK, O
    SOCIAL NETWORKS, 1978, 1 (01) : 91 - 101
  • [4] A distributed model for sampling large scale social networks
    Jaouadi, Myriam
    Ben Romdhane, Lotfi
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [5] Sampling from Social Networks with Attributes
    Wagner, Claudia
    Singer, Philipp
    Karimi, Fariba
    Pfeffer, Juergen
    Strohmaier, Markus
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 1181 - 1190
  • [6] Hybrid representative sampling of social media
    Beauvais, Taylor
    BMS-BULLETIN OF SOCIOLOGICAL METHODOLOGY-BULLETIN DE METHODOLOGIE SOCIOLOGIQUE, 2023, 160 (01): : 57 - 70
  • [7] A metropolis sampling method for drawing representative samples from large databases
    Guo, H
    Hou, WC
    Yan, F
    Zhu, Q
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 226 - 238
  • [8] Representative sampling in large-scale surveys
    Stephan, FF
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1939, 34 (206) : 343 - 352
  • [9] Inferring Missing Attributes of Users in Large-Scale Social networks
    Wang, Huadeng
    Xu, Songhua
    Liu, Lihui
    Luo, Xiaonan
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 207 - 211
  • [10] The impact of the Internet on the social lives of users: A representative sample from 13 countries
    Amichai-Hamburger, Yair
    Hayat, Zack
    COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (01) : 585 - 589