Evaluating the Representativeness in the Geographic Distribution of Twitter User Population

被引:7
|
作者
Yin, Junjun [1 ]
Chi, Guangqing [2 ]
Van Hook, Jennifer [3 ]
机构
[1] Penn State Univ, Social Sci Res Inst, State Coll, PA 16801 USA
[2] Penn State Univ, Dept Agr Econ Sociol & Educ, State Coll, PA USA
[3] Penn State Univ, Dept Sociol & Criminol, State Coll, PA USA
来源
PROCEEDINGS OF THE 12TH WORKSHOP ON GEOGRAPHIC INFORMATION RETRIEVAL (GIR'18) | 2018年
关键词
Geo-tagged Tweets; Demographics; Bias; Representativeness; Geographic Distribution;
D O I
10.1145/3281354.3281360
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter data are becoming a Big Data stream and have drawn multidisciplinary interests to study population characteristics and social problems that cannot be measured well by traditional surveys. However, the use of Twitter data has been strongly resisted because of concerns about the representativeness of the population as we know little about the demographic characters of the users. It is critical to evaluate the extent to which Twitter users represent the population across different demographic groups. This study evaluates the representativeness and examines the geographic distributions of Twitter user population and its correspondence to the real population. By estimating Twitter user demographics for the contiguous U.S. in 2014, the preliminary results revealed both over- and under-representation of certain demographic groups against the real population at county-level. A representation index is used to assess the representativeness of Twitter samples geographically, which may help further studies to identify the determinants of biases.
引用
收藏
页数:2
相关论文
共 50 条
  • [31] Population sociodemographic and geographic factors associated with dermatologist distribution in the US
    Gotschall, J. W.
    Genderson, D.
    Fitzsimmons, R.
    Wiebe, D.
    Shin, D. B.
    Takeshita, J.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2023, 143 (05) : S224 - S224
  • [32] Psychosocial Care Centers for Children and Adolescents in Brazil: geographic distribution and user profile
    Garcia, Grey Yuliet Ceballos
    Santos, Darci Neves
    Machado, Daiane Borges
    CADERNOS DE SAUDE PUBLICA, 2015, 31 (12): : 2649 - 2654
  • [33] Barrai’s parameters for the Kirov oblast population and their geographic distribution
    G. I. El’chinova
    O. A. Poriadina
    I. G. Terekhovskaya
    A. A. Osetrova
    V. V. Kadyshev
    R. A. Zinchenko
    Russian Journal of Genetics, 2010, 46 : 625 - 629
  • [34] GEOGRAPHIC REGULARITIES IN MICROBE POPULATION (HETEROTROPH) DISTRIBUTION IN THE WORLD OCEAN
    KRISS, AE
    ABYZOV, SS
    LEBEDEVA, MN
    MISHUSTINA, IE
    MITSKEVICH, IN
    JOURNAL OF BACTERIOLOGY, 1960, 80 (06) : 731 - 736
  • [35] Some geographic influences in the settlement of Michigan and in the distribution of its population
    Genthe, M. K.
    PETERMANNS MITTEILUNGEN, 1914, 60 : 98 - 98
  • [36] Dentist to population ratio and geographic distribution of dentists in Iran in 2019
    Afsahi, Mahmoud
    Haghdoost, Ali Akbar
    Houshmand, Behzad
    Dehghani, Mahmoudreza
    Amanpour, Sara
    JOURNAL OF ORAL HEALTH AND ORAL EPIDEMIOLOGY, 2021, 10 (02): : 72 - 80
  • [37] SOME GEOGRAPHIC INFLUENCES IN THE SETTLEMENT OF MICHIGAN AND IN THE DISTRIBUTION OF ITS POPULATION
    Miller, George J.
    BULLETIN OF THE AMERICAN GEOGRAPHICAL SOCIETY OF NEW YORK, 1913, 45 (05): : 321 - 348
  • [38] Evaluating geo-located Twitter data as a control layer for areal interpolation of population
    Lin, Jie
    Cromley, Robert G.
    APPLIED GEOGRAPHY, 2015, 58 : 41 - 47
  • [39] Evaluating the Geographic in GIS
    Guan, Weihe W.
    Wilson, Matthew W.
    Knowles, Anne K.
    GEOGRAPHICAL REVIEW, 2019, 109 (03) : 297 - 307
  • [40] EVALUATING GEOGRAPHIC LEARNING
    KURFMAN, DG
    NATIONAL COUNCIL FOR THE SOCIAL STUDIES-YEARBOOK, 1970, (40): : 355 - 377