Evaluating the Representativeness in the Geographic Distribution of Twitter User Population

被引:7
|
作者
Yin, Junjun [1 ]
Chi, Guangqing [2 ]
Van Hook, Jennifer [3 ]
机构
[1] Penn State Univ, Social Sci Res Inst, State Coll, PA 16801 USA
[2] Penn State Univ, Dept Agr Econ Sociol & Educ, State Coll, PA USA
[3] Penn State Univ, Dept Sociol & Criminol, State Coll, PA USA
来源
PROCEEDINGS OF THE 12TH WORKSHOP ON GEOGRAPHIC INFORMATION RETRIEVAL (GIR'18) | 2018年
关键词
Geo-tagged Tweets; Demographics; Bias; Representativeness; Geographic Distribution;
D O I
10.1145/3281354.3281360
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter data are becoming a Big Data stream and have drawn multidisciplinary interests to study population characteristics and social problems that cannot be measured well by traditional surveys. However, the use of Twitter data has been strongly resisted because of concerns about the representativeness of the population as we know little about the demographic characters of the users. It is critical to evaluate the extent to which Twitter users represent the population across different demographic groups. This study evaluates the representativeness and examines the geographic distributions of Twitter user population and its correspondence to the real population. By estimating Twitter user demographics for the contiguous U.S. in 2014, the preliminary results revealed both over- and under-representation of certain demographic groups against the real population at county-level. A representation index is used to assess the representativeness of Twitter samples geographically, which may help further studies to identify the determinants of biases.
引用
收藏
页数:2
相关论文
共 50 条
  • [21] Detecting TV Program Highlight Scenes Using Twitter Data Classified by Twitter User Behavior and Evaluating It to Soccer Game TV Programs
    Hayama, Tessai
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (04) : 917 - 924
  • [22] Interpreting Twitter User Geolocation
    Zhong, Ting
    Wang, Tianliang
    Zhou, Fan
    Trajcevski, Goce
    Zhang, Kunpeng
    Yang, Yi
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 853 - 859
  • [23] Monitoring User Evolution in Twitter
    Lauschke, Claudia
    Ntoutsi, Eirini
    2012 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2012, : 972 - 977
  • [24] FACTUS: Faceted Twitter User Search Using Twitter Lists
    Komamizu, Takahiro
    Yamaguchi, Yuto
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2011, 2011, 6997 : 343 - 344
  • [25] Representativeness of Abortion Legislation Debate on Twitter: A Case Study in Argentina and Chile
    Graells-Garrido, Eduardo
    Baeza-Yates, Ricardo
    Lalmas, Mounia
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 765 - 774
  • [26] RECRUITMENT STRATEGIES AND GEOGRAPHIC REPRESENTATIVENESS FOR PATIENT SURVEYS IN RARE DISEASES
    Yu, J.
    Paranagama, D.
    Parasuraman, S.
    VALUE IN HEALTH, 2017, 20 (05) : A338 - A338
  • [27] Barrai's Parameters for the Kirov Oblast Population and Their Geographic Distribution
    El'chinova, G. I.
    Poriadina, O. A.
    Terekhovskaya, I. G.
    Osetrova, A. A.
    Kadyshev, V. V.
    Zinchenko, R. A.
    RUSSIAN JOURNAL OF GENETICS, 2010, 46 (05) : 625 - 629
  • [28] Molecular Characterization and Geographic Distribution of a Mymonavirus in the Population of Botrytis cinerea
    Hao, Fangmin
    Wu, Mingde
    Li, Guoqing
    VIRUSES-BASEL, 2018, 10 (08):
  • [29] GEOGRAPHIC-DISTRIBUTION OF FRANCES POPULATION AND REGIONAL-DEVELOPMENT
    PERRIN, N
    POPULATION, 1956, 11 (04): : 701 - 724
  • [30] The Geographic Distribution of Pediatric Anesthesiologists Relative to the US Pediatric Population
    Muffly, Matthew K.
    Medeiros, David
    Muffly, Tyler M.
    Singleton, Mark A.
    Honkanen, Anita
    ANESTHESIA AND ANALGESIA, 2017, 125 (01): : 261 - 267