Taming Uncertainty in Big Data Evidence from Social Media in Urban Areas

被引:11
|
作者
Bendler, Johannes [1 ]
Wagner, Sebastian [1 ]
Brandt, Tobias [1 ]
Neumann, Dirk [1 ]
机构
[1] Univ Freiburg, D-79098 Freiburg, Germany
来源
关键词
Big data; Uncertainty; Social media; Veracity; Spatio-temporal patterns; Points of interest;
D O I
10.1007/s12599-014-0342-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the classic definition of Big Data included the dimensions volume, velocity, and variety, a fourth dimension, veracity, has recently come to the attention of researchers and practitioners. The increasing amount of user-generated data associated with the rise of social media emphasizes the need for methods to deal with the uncertainty inherent to these data sources. In this paper we address one aspect of uncertainty by developing a new methodology to establish the reliability of user-generated data based upon causal links with recurring patterns. We associate a large data set of geo-tagged Twitter messages in San Francisco with points of interest, such as bars, restaurants, or museums, within the city. This model is validated by causal relationships between a point of interest and the amount of messages in its vicinity. We subsequently analyze the behavior of these messages over time using a jackknifing procedure to identify categories of points of interest that exhibit consistent patterns over time. Ultimately, we condense this analysis into an indicator that gives evidence on the certainty of a data set based on these causal relationships and recurring patterns in temporal and spatial dimensions.
引用
收藏
页码:279 / 288
页数:10
相关论文
共 50 条
  • [31] Big data & politics: From stories to data. Persuade in the age of social media
    Hendi, Carolina Ines Regalia
    TEKNOKULTURA: REVISTA DE CULTURA DIGITAL Y MOVIMIENTOS SOCIALES, 2023, 20 (02): : 285 - 287
  • [32] Making Big Data Small: Strategies to Expand Urban and Geographical Research Using Social Media
    Poorthuis, Ate
    Zook, Matthew
    JOURNAL OF URBAN TECHNOLOGY, 2017, 24 (04) : 115 - 135
  • [33] Eliciting users' preferences and values in urban parks: Evidence from analyzing social media data from Hong Kong
    Wan, Calvin
    Shen, Geoffrey Qiping
    Choi, Stella
    URBAN FORESTRY & URBAN GREENING, 2021, 62
  • [34] Evaluating the Quality of Social Media Data in Big Data Architecture
    Immonen, Anne
    Paakkonen, Pekka
    Ovaska, Eila
    IEEE ACCESS, 2015, 3 : 2028 - 2043
  • [35] A SURVEY ON BIG DATA ANALYTICS USING SOCIAL MEDIA DATA
    Paul, P. Victer
    Monica, K.
    Trishanka, M.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [36] A Survey of Social Media, Big Data, Data Mining, and Analytics
    Oliverio, Jared
    JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT-INNOVATION AND ENTREPRENEURSHIP, 2018, 3 (03):
  • [37] Social Progress Index for Urban and Rural Areas of a Region: Evidence from Peru
    Inga-Hancco, Maylee
    Indigoyen-Porras, Adamari
    Parra-Alarcon, Sergio
    Cerron-Aliaga, Juan
    Vicente-Ramos, Wagner
    STATISTIKA-STATISTICS AND ECONOMY JOURNAL, 2021, 101 (04) : 422 - 435
  • [38] Dynamic communication and perception of cyber risk: Evidence from big data in media
    Xu, Wei
    Murphy, Finbarr
    Xu, Xian
    Xing, Wenpeng
    COMPUTERS IN HUMAN BEHAVIOR, 2021, 122
  • [39] From Big Bang to Big Data: A History of the Media
    Cramer, Dana
    CANADIAN JOURNAL OF COMMUNICATION, 2024, 49 (04) : 654 - 656
  • [40] Media history: From big bang to big data
    Jonsson, Sverker
    HISTORISK TIDSKRIFT, 2021, 141 (02): : 352 - 354