Quality Assessment and Biases in Reused Data

被引:3
|
作者
Fernandez-Ardevo, Mireia [1 ,2 ]
Rosales, Andrea [1 ,2 ]
机构
[1] Univ Oberta Catalunya UOC, Fac Informat & Commun Sci, Barcelona, Catalonia, Spain
[2] Univ Oberta Catalunya UOC, IN3 Internet Interdisciplinary Inst, Barcelona, Catalonia, Spain
关键词
data quality; data biases; reused data; reused traces; open data; online behavioral advertising;
D O I
10.1177/00027642221144855
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
This article investigates digital and non-digital traces reused beyond the context of creation. A central idea of this article is that no (reused) dataset is perfect. Therefore, data quality assessment becomes essential to determine if a given dataset is "good enough" to be used to fulfill the users' goals. Biases, a possible source of discrimination, have become a relevant data challenge. Consequently, it is appropriate to analyze whether quality assessment indicators provide information on potential biases in the dataset. We use examples representing two opposing sides regarding data access to reflect on the relationship between quality and bias. First, the European Union open data portal fosters the democratization of data and expects users to manipulate the databases directly to perform their analyses. Second, online behavioral advertising systems offer individualized promotional services but do not share the datasets supporting their design. Quality assessment is socially constructed, as there is not a universal definition but a set of quality dimensions, which might change for each professional context. From the users' perspective, trust/credibility stands out as a relevant quality dimension in the two analyzed cases. Results show that quality indicators (whatever they are) provide limited information on potential biases. We suggest that data literacy is most needed among both open data users and clients of behavioral advertising systems. Notably, users must (be able to) understand the limitations of datasets for an optimal and bias-free interpretation of results and decision-making.
引用
收藏
页码:696 / 710
页数:15
相关论文
共 50 条
  • [21] A Linked Data Quality Assessment Framework for Network Data
    To, Alex
    Meymandpour, Rouzbeh
    Davis, Joseph G.
    Jourjon, Guillaume
    Chan, Jonathan
    PROCEEDINGS OF THE 2ND ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2019, 2019,
  • [22] Method for Data Quality Assessment of Synthetic Industrial Data
    Iantovics, Laszlo Barna
    Enachescu, Calin
    SENSORS, 2022, 22 (04)
  • [23] Data relevance and data quality for ecological risk assessment
    Breton, Roger L.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 248
  • [24] Big Data Quality Assessment Model for Unstructured Data
    Taleb, Ikbal
    Serhani, Mohamed Adel
    Dssouli, Rachida
    PROCEEDINGS OF THE 2018 13TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2018, : 69 - 74
  • [25] A data driven learning approach for the assessment of data quality
    Erik Tute
    Nagarajan Ganapathy
    Antje Wulff
    BMC Medical Informatics and Decision Making, 21
  • [26] On the Importance of Data Quality Assessment of Crowdsourced Meteorological Data
    Vuckovic, Milena
    Schmidt, Johanna
    SUSTAINABILITY, 2023, 15 (08)
  • [27] Organizing Data Quality Assessment of Shifting Biomedical Data
    Saez, Carlos
    Martinez-Miranda, Juan
    Robles, Montserrat
    Miguel Garcia-Gomez, Juan
    QUALITY OF LIFE THROUGH QUALITY OF INFORMATION, 2012, 180 : 721 - 725
  • [28] A data driven learning approach for the assessment of data quality
    Tute, Erik
    Ganapathy, Nagarajan
    Wulff, Antje
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [29] Statistical data analysis in quality assessment
    Ohsumi, N
    JOURNAL OF THE FOOD HYGIENIC SOCIETY OF JAPAN, 1999, 40 (02): : J214 - J221
  • [30] Customized Quality Assessment of Healthcare Data
    Shin, Jieun
    Kim, Jong-Yeup
    ANNALS OF LABORATORY MEDICINE, 2024, 44 (06) : 472 - 477