Taming Uncertainty in Big Data Evidence from Social Media in Urban Areas

被引:11
|
作者
Bendler, Johannes [1 ]
Wagner, Sebastian [1 ]
Brandt, Tobias [1 ]
Neumann, Dirk [1 ]
机构
[1] Univ Freiburg, D-79098 Freiburg, Germany
来源
关键词
Big data; Uncertainty; Social media; Veracity; Spatio-temporal patterns; Points of interest;
D O I
10.1007/s12599-014-0342-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the classic definition of Big Data included the dimensions volume, velocity, and variety, a fourth dimension, veracity, has recently come to the attention of researchers and practitioners. The increasing amount of user-generated data associated with the rise of social media emphasizes the need for methods to deal with the uncertainty inherent to these data sources. In this paper we address one aspect of uncertainty by developing a new methodology to establish the reliability of user-generated data based upon causal links with recurring patterns. We associate a large data set of geo-tagged Twitter messages in San Francisco with points of interest, such as bars, restaurants, or museums, within the city. This model is validated by causal relationships between a point of interest and the amount of messages in its vicinity. We subsequently analyze the behavior of these messages over time using a jackknifing procedure to identify categories of points of interest that exhibit consistent patterns over time. Ultimately, we condense this analysis into an indicator that gives evidence on the certainty of a data set based on these causal relationships and recurring patterns in temporal and spatial dimensions.
引用
收藏
页码:279 / 288
页数:10
相关论文
共 50 条
  • [41] Intra-Urban Human Mobility and Activity Transition: Evidence from Social Media Check-In Data
    Wu, Lun
    Zhi, Ye
    Sui, Zhengwei
    Liu, Yu
    PLOS ONE, 2014, 9 (05):
  • [42] Recreational visits to urban parks and factors affecting park visits: Evidence from geotagged social media data
    Zhang, Sai
    Zhou, Weiqi
    LANDSCAPE AND URBAN PLANNING, 2018, 180 : 27 - 35
  • [43] From Survey to Social Media: Public Opinion and Politics in the Age of Big Data
    Salleh, Shahnon Mohamed
    ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 10696 - 10700
  • [44] Activity Pattern Mining from Social Media for Healthcare Monitoring on Big data
    Sadagopan, S.
    Michael, G.
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, : 184 - 189
  • [45] Forecasting corporate credit ratings using big data from social media
    Chen, Yuh-Jen
    Chen, Yuh-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 207
  • [46] The impact of data elements on urban sustainable development: Evidence from the big data policy in China
    Wu, Tao
    Xu, Wenxuan
    Kung, Chih-Chun
    TECHNOLOGY IN SOCIETY, 2025, 81
  • [47] Learning from big data with uncertainty - editorial
    Wang, Xizhao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (05) : 2329 - 2330
  • [48] Editorial: Uncertainty in learning from big data
    Wang, Xizhao
    Huang, Joshua Zhexue
    FUZZY SETS AND SYSTEMS, 2015, 258 : 1 - 4
  • [49] Social Media Monitoring mit Big Data Technologien
    Gerd König
    Christian Gügi
    HMD Praxis der Wirtschaftsinformatik, 2014, 51 (4) : 424 - 435
  • [50] Big data privacy issues in public social media
    Smith, M. (smith@dcsec.uni-hannover.de), 1600, IEEE Computer Society