Taming Uncertainty in Big Data Evidence from Social Media in Urban Areas

被引:11
|
作者
Bendler, Johannes [1 ]
Wagner, Sebastian [1 ]
Brandt, Tobias [1 ]
Neumann, Dirk [1 ]
机构
[1] Univ Freiburg, D-79098 Freiburg, Germany
来源
关键词
Big data; Uncertainty; Social media; Veracity; Spatio-temporal patterns; Points of interest;
D O I
10.1007/s12599-014-0342-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the classic definition of Big Data included the dimensions volume, velocity, and variety, a fourth dimension, veracity, has recently come to the attention of researchers and practitioners. The increasing amount of user-generated data associated with the rise of social media emphasizes the need for methods to deal with the uncertainty inherent to these data sources. In this paper we address one aspect of uncertainty by developing a new methodology to establish the reliability of user-generated data based upon causal links with recurring patterns. We associate a large data set of geo-tagged Twitter messages in San Francisco with points of interest, such as bars, restaurants, or museums, within the city. This model is validated by causal relationships between a point of interest and the amount of messages in its vicinity. We subsequently analyze the behavior of these messages over time using a jackknifing procedure to identify categories of points of interest that exhibit consistent patterns over time. Ultimately, we condense this analysis into an indicator that gives evidence on the certainty of a data set based on these causal relationships and recurring patterns in temporal and spatial dimensions.
引用
收藏
页码:279 / 288
页数:10
相关论文
共 50 条
  • [1] Taming Uncertainty in Big DataEvidence from Social Media in Urban Areas
    Johannes Bendler
    Sebastian Wagner
    Tobias Brandt
    Dirk Neumann
    Business & Information Systems Engineering, 2014, 6 : 279 - 288
  • [2] Taming Big Data: Using App Technology to Study Organizational Behavior on Social Media
    Bail, Christopher A.
    SOCIOLOGICAL METHODS & RESEARCH, 2017, 46 (02) : 189 - 217
  • [3] Flowers as attractions in urban parks: Evidence from social media data
    Mou, Naixia
    Wang, Jinhua
    Zheng, Yunhao
    Zhang, Lingxian
    Makkonen, Teemu
    Yang, Tengfei
    Niu, Jiqiang
    URBAN FORESTRY & URBAN GREENING, 2023, 82
  • [4] Compromised Data: From Social Media to Big Data
    Kent, Michael L.
    INTERNATIONAL JOURNAL OF COMMUNICATION, 2018, 12 : 2725 - 2729
  • [5] The Geography of Social Media Data in Urban Areas: Representativeness and Complementarity
    Bernabeu-Bautista, Alvaro
    Serrano-Estrada, Leticia
    Perez-Sanchez, V. Raul
    Marti, Pablo
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (11)
  • [6] Chatty maps: constructing sound maps of urban areas from social media data
    Aiello, Luca Maria
    Schifanella, Rossano
    Quercia, Daniele
    Aletta, Francesco
    ROYAL SOCIETY OPEN SCIENCE, 2016, 3 (03):
  • [7] Social Media Meets Big Urban Data: A Case Study of Urban Waterlogging Analysis
    Zhang, Ningyu
    Chen, Huajun
    Chen, Jiaoyan
    Chen, Xi
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [8] Using social media data to map urban areas: ideas and limits
    Miao, Z.
    Iannelli, G. C.
    Gamba, P.
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5800 - 5803
  • [9] Situation monitoring of urban areas using social media data streams
    Weiler, Andreas
    Grossniklaus, Michael
    Scholl, Marc H.
    INFORMATION SYSTEMS, 2016, 57 : 129 - 141
  • [10] Public Concern and Awareness of National Parks in China: Evidence from Social Media Big Data and Questionnaire Data
    Dou, Yaquan
    Wu, Changhao
    He, Youjun
    SUSTAINABILITY, 2023, 15 (03)