Practical Recommendations on Crawling Online Social Networks

被引:130
|
作者
Gjoka, Minas [1 ]
Kurant, Maciej [1 ]
Butts, Carter T. [1 ,2 ]
Markopoulou, Athina [1 ,3 ]
机构
[1] Univ Calif Irvine, Calif Inst Telecomm & Informat Technol CalIT2, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Sociol, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept EECS, Irvine, CA 92697 USA
基金
瑞士国家科学基金会; 美国国家科学基金会;
关键词
Sampling methods; Social network services; Facebook; Random Walks; Convergence; Measurements; Graph sampling;
D O I
10.1109/JSAC.2011.111011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Our goal in this paper is to develop a practical framework for obtaining a uniform sample of users in an online social network (OSN) by crawling its social graph. Such a sample allows to estimate any user property and some topological properties as well. To this end, first, we consider and compare several candidate crawling techniques. Two approaches that can produce approximately uniform samples are the Metropolis-Hasting random walk (MHRW) and a re-weighted random walk (RWRW). Both have pros and cons, which we demonstrate through a comparison to each other as well as to the "ground truth." In contrast, using Breadth-First-Search (BFS) or an unadjusted Random Walk (RW) leads to substantially biased results. Second, and in addition to offline performance assessment, we introduce online formal convergence diagnostics to assess sample quality during the data collection process. We show how these diagnostics can be used to effectively determine when a random walk sample is of adequate size and quality. Third, as a case study, we apply the above methods to Facebook and we collect the first, to the best of our knowledge, representative sample of Facebook users. We make it publicly available and employ it to characterize several key properties of Facebook.
引用
收藏
页码:1872 / 1892
页数:21
相关论文
共 50 条
  • [21] ONLINE SOCIAL NETWORKS
    Cardon, Peter W.
    BUSINESS AND PROFESSIONAL COMMUNICATION QUARTERLY, 2009, 72 (01) : 96 - 97
  • [22] Online Social Networks
    Faloutsos, Michalis
    Karagiannis, Thomas
    Moon, Sue
    IEEE NETWORK, 2010, 24 (05): : 4 - 5
  • [23] Recommendations in Signed Social Networks
    Tang, Jiliang
    Aggarwal, Charu
    Liu, Huan
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 31 - 40
  • [24] Online Social Networks
    Fu, Xiaoming
    Passarella, Andrea
    Quercia, Daniele
    Sala, Alessandra
    Strufe, Thorsten
    COMPUTER COMMUNICATIONS, 2016, 73 : 163 - 166
  • [25] On Social Synchrony in Online Social Networks
    Sivaraman, Nirmal Kumar
    Muthiah, Sakthi Balan
    Agarwal, Pushkal
    Todwal, Lokesh
    PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 417 - 418
  • [26] Social capital in Online social networks
    Kazienko, Przemyslaw
    Musial, Katarzyna
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 417 - 424
  • [27] A practical guide to social networks
    Cross, R
    Liedtka, J
    Weiss, L
    HARVARD BUSINESS REVIEW, 2005, 83 (03) : 124 - +
  • [28] Online Social Networks to Online Social Malworks - the Evolution of an Industry
    Louw, Candice
    Von Solms, Sebastiaan
    2014 IST-AFRICA CONFERENCE PROCEEDINGS, 2014,
  • [29] Online Social Networks and Trust
    Sabatini, Fabio
    Sarracino, Francesco
    SOCIAL INDICATORS RESEARCH, 2019, 142 (01) : 229 - 260
  • [30] Models of Online Social Networks
    Bonato, Anthony
    Hadi, Noor
    Horn, Paul
    Pralat, Pawel
    Wang, Changping
    INTERNET MATHEMATICS, 2009, 6 (03) : 285 - 313