A Data Quality Multidimensional Model for Social Media Analysis

被引:0
|
作者
Aramburu, Maria Jose [1 ]
Berlanga, Rafael [2 ]
Lanza-Cruz, Indira [2 ]
机构
[1] Univ Jaume 1, Dept Deengn & Ciencia Comp, Castellon de La Plana 12071, Spain
[2] Univ Jaume 1, Dept Llenguatges & Sistemes Informat, Castellon de La Plana 12071, Spain
关键词
Data quality; Social media data; Business intelligence; Text analytics; DECISION-MAKING; ANALYTICS; CREDIBILITY; TWITTER;
D O I
10.1007/s12599-023-00840-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media platforms have become a new source of useful information for companies. Ensuring the business value of social media first requires an analysis of the quality of the relevant data and then the development of practical business intelligence solutions. This paper aims at building high-quality datasets for social business intelligence (SoBI). The proposed method offers an integrated and dynamic approach to identify the relevant quality metrics for each analysis domain. This method employs a novel multidimensional data model for the construction of cubes with impact measures for various quality metrics. In this model, quality metrics and indicators are organized in two main axes. The first one concerns the kind of facts to be extracted, namely: posts, users, and topics. The second axis refers to the quality perspectives to be assessed, namely: credibility, reputation, usefulness, and completeness. Additionally, quality cubes include a user-role dimension so that quality metrics can be evaluated in terms of the user business roles. To demonstrate the usefulness of this approach, the authors have applied their method to two separate domains: automotive business and natural disasters management. Results show that the trade-off between quantity and quality for social media data is focused on a small percentage of relevant users. Thus, data filtering can be easily performed by simply ranking the posts according to the quality metrics identified with the proposed method. As far as the authors know, this is the first approach that integrates both the extraction of analytical facts and the assessment of social media data quality in the same framework.
引用
收藏
页码:667 / 689
页数:23
相关论文
共 50 条
  • [11] Quality management architecture for social media data
    Pääkkönen P.
    Jokitulppo J.
    Journal of Big Data, 4 (1)
  • [12] Multidimensional Data Model for Air Pollution Data Analysis
    Doreswamy
    Harishkumar, K. S.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1684 - 1689
  • [13] The Semantic Network Model of Creativity: Analysis of Online Social Media Data
    Yu, Feng
    Peng, Theodore
    Peng, Kaiping
    Zheng, Sam Xianjun
    Liu, Zhiyuan
    CREATIVITY RESEARCH JOURNAL, 2016, 28 (03) : 268 - 274
  • [14] Multidimensional Analysis of Hot Events from Social Media Sources
    Troudi, Abir
    Jamoussi, Salma
    Zayani, Corinne Amel
    Amous, Ikram
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 2112 - 2119
  • [15] Social Media and Twitter Data Quality for New Social Indicators
    Salvatore, Camilla
    Biffignandi, Silvia
    Bianchi, Annamaria
    SOCIAL INDICATORS RESEARCH, 2021, 156 (2-3) : 601 - 630
  • [16] Social Media and Twitter Data Quality for New Social Indicators
    Camilla Salvatore
    Silvia Biffignandi
    Annamaria Bianchi
    Social Indicators Research, 2021, 156 : 601 - 630
  • [17] Evaluating the Quality of Social Media Data in Big Data Architecture
    Immonen, Anne
    Paakkonen, Pekka
    Ovaska, Eila
    IEEE ACCESS, 2015, 3 : 2028 - 2043
  • [18] An Ontology for Social Media Data Analysis
    Jain, Sarika
    Dalal, Sumit
    Dave, Mayank
    SEMANTIC INTELLIGENCE, ISIC 2022, 2023, 964 : 77 - 87
  • [19] Visual Analysis of Social Media Data
    Schreck, Tobias
    Keim, Daniel
    COMPUTER, 2013, 46 (05) : 68 - 75
  • [20] Towards a Model for the Multidimensional Analysis of Field Data
    Bimonte, Sandro
    Kang, Myoung-Ah
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, 2010, 6295 : 58 - +