Taming Uncertainty in Big Data Evidence from Social Media in Urban Areas

被引:11
|
作者
Bendler, Johannes [1 ]
Wagner, Sebastian [1 ]
Brandt, Tobias [1 ]
Neumann, Dirk [1 ]
机构
[1] Univ Freiburg, D-79098 Freiburg, Germany
来源
关键词
Big data; Uncertainty; Social media; Veracity; Spatio-temporal patterns; Points of interest;
D O I
10.1007/s12599-014-0342-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the classic definition of Big Data included the dimensions volume, velocity, and variety, a fourth dimension, veracity, has recently come to the attention of researchers and practitioners. The increasing amount of user-generated data associated with the rise of social media emphasizes the need for methods to deal with the uncertainty inherent to these data sources. In this paper we address one aspect of uncertainty by developing a new methodology to establish the reliability of user-generated data based upon causal links with recurring patterns. We associate a large data set of geo-tagged Twitter messages in San Francisco with points of interest, such as bars, restaurants, or museums, within the city. This model is validated by causal relationships between a point of interest and the amount of messages in its vicinity. We subsequently analyze the behavior of these messages over time using a jackknifing procedure to identify categories of points of interest that exhibit consistent patterns over time. Ultimately, we condense this analysis into an indicator that gives evidence on the certainty of a data set based on these causal relationships and recurring patterns in temporal and spatial dimensions.
引用
收藏
页码:279 / 288
页数:10
相关论文
共 50 条
  • [21] A Text Mining Analysis on Big Data Extracted from Social Media
    Schoier, Gabriella
    Borruso, Giuseppe
    Tossut, Pietro
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2020, PART IV, 2020, 12252 : 351 - 364
  • [22] Public perspective on renewable and other energy resources: Evidence from social media big data and sentiment analysis
    Jeong, Dahye
    Hwang, Syjung
    Kim, Jisu
    Yu, Hyerim
    Park, Eunil
    ENERGY STRATEGY REVIEWS, 2023, 50
  • [23] The Impact of Digitization on Urban Social-Ecological Resilience: Evidence from Big Data Policy Pilots in China
    Zhou, Yucen
    Wang, Zhong
    Liu, Lifeng
    Peng, Yanran
    Ihimbazwe, Beatrice
    SUSTAINABILITY, 2025, 17 (02)
  • [24] Social media big data analytics: A survey
    Ghani, Norjihan Abdul
    Hamid, Suraya
    Hashem, Ibrahim Abaker Targio
    Ahmed, Ejaz
    COMPUTERS IN HUMAN BEHAVIOR, 2019, 101 : 417 - 428
  • [25] Social Media Analytics Based on Big Data
    Shaikh, Farzana
    Rangrez, Firdaus
    Khan, Afsha
    Shaikh, Uzma
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL (I2C2), 2017,
  • [26] A taxonomy and survey of big data in social media
    Hemmati, Atefeh
    Arzanagh, Hanieh Mohammadi
    Rahmani, Amir Masoud
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (01):
  • [27] Big data transforms the interpretation of the social media
    Martinez-Martinez, Silvia
    Lara-Navarra, Pablo
    PROFESIONAL DE LA INFORMACION, 2014, 23 (06): : 575 - 581
  • [28] Challenging Citizenship: Social Media and Big Data
    Mirko Tobias Schäfer
    Computer Supported Cooperative Work (CSCW), 2016, 25 : 111 - 113
  • [29] Big Data Privacy in Social Media Sites
    Shozi, Nobubele Angel
    Mtsweni, Jabu
    2017 IST-AFRICA WEEK CONFERENCE (IST-AFRICA), 2017,
  • [30] Challenging Citizenship: Social Media and Big Data
    Schafer, Mirko Tobias
    COMPUTER SUPPORTED COOPERATIVE WORK-THE JOURNAL OF COLLABORATIVE COMPUTING, 2016, 25 (2-3): : 111 - 113