Quality Assessment and Biases in Reused Data

被引:3
|
作者
Fernandez-Ardevo, Mireia [1 ,2 ]
Rosales, Andrea [1 ,2 ]
机构
[1] Univ Oberta Catalunya UOC, Fac Informat & Commun Sci, Barcelona, Catalonia, Spain
[2] Univ Oberta Catalunya UOC, IN3 Internet Interdisciplinary Inst, Barcelona, Catalonia, Spain
关键词
data quality; data biases; reused data; reused traces; open data; online behavioral advertising;
D O I
10.1177/00027642221144855
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
This article investigates digital and non-digital traces reused beyond the context of creation. A central idea of this article is that no (reused) dataset is perfect. Therefore, data quality assessment becomes essential to determine if a given dataset is "good enough" to be used to fulfill the users' goals. Biases, a possible source of discrimination, have become a relevant data challenge. Consequently, it is appropriate to analyze whether quality assessment indicators provide information on potential biases in the dataset. We use examples representing two opposing sides regarding data access to reflect on the relationship between quality and bias. First, the European Union open data portal fosters the democratization of data and expects users to manipulate the databases directly to perform their analyses. Second, online behavioral advertising systems offer individualized promotional services but do not share the datasets supporting their design. Quality assessment is socially constructed, as there is not a universal definition but a set of quality dimensions, which might change for each professional context. From the users' perspective, trust/credibility stands out as a relevant quality dimension in the two analyzed cases. Results show that quality indicators (whatever they are) provide limited information on potential biases. We suggest that data literacy is most needed among both open data users and clients of behavioral advertising systems. Notably, users must (be able to) understand the limitations of datasets for an optimal and bias-free interpretation of results and decision-making.
引用
收藏
页码:696 / 710
页数:15
相关论文
共 50 条
  • [1] Quality assessment of treated wastewater to be reused in agriculture
    Rahimi, M. H.
    Kalantari, N.
    Sharifidoost, M.
    Kazemi, M.
    GLOBAL JOURNAL OF ENVIRONMENTAL SCIENCE AND MANAGEMENT-GJESM, 2018, 4 (02): : 217 - 230
  • [2] Assessment of reused catheters
    ASAIO J, 3 (M611):
  • [3] Uses and Biases of Volunteer Water Quality Data
    Loperfido, J. V.
    Beyer, Pieter
    Just, Craig L.
    Schnoor, Jerald L.
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2010, 44 (19) : 7193 - 7199
  • [4] A Web-Based Tool to Evaluate Data Quality of Reused Health Data Assets
    Wendl, Christopher
    Duftschmid, Georg
    Gezgin, Deniz
    Popper, Niki
    Miksch, Florian
    Rinner, Christoph
    HEALTH INFORMATICS MEETS EHEALTH: DIGITAL INSIGHT - INFORMATION-DRIVEN HEALTH & CARE, 2017, 236 : 204 - 210
  • [5] SANITARY QUALITY OF BROILER LITTER REUSED
    Vieira, Maria de F. A.
    Tinoco, Ilda de F. F.
    dos Santos, Bernadete M.
    Inoue, Keles R. A.
    Mendes, Mucio A. dos S. A.
    ENGENHARIA AGRICOLA, 2015, 35 (05): : 800 - 807
  • [6] Disregarding multimappers leads to biases in the functional assessment of NGS data
    da Paz, Michelle Almeida
    Warger, Sarah
    Taher, Leila
    BMC GENOMICS, 2024, 25 (01)
  • [7] Quality assessment of microarrays: Visualization of spatial artifacts and quantitation of regional biases
    Mark Reimers
    John N Weinstein
    BMC Bioinformatics, 6
  • [8] Quality assessment of microarrays: Visualization of spatial artifacts and quantitation of regional biases
    Reimers, M
    Weinstein, JN
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [9] Synthetic Data and Computer-Vision-Based Automated Quality Inspection System for Reused Scaffolding
    Kim, Alexander
    Lee, Kyuhyup
    Lee, Seojoon
    Song, Jinwoo
    Kwon, Soonwook
    Chung, Suwan
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [10] A study of quality management strategy for reused products
    Lo, Hui-Chiung
    Yu, Rouh-Yun
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2013, 119 : 172 - 177