Discrimination Aware Classification for Imbalanced Datasets

被引:0
|
作者
Ristanoski, Goce [1 ]
Liu, Wei [2 ]
Bailey, James [1 ]
机构
[1] Univ Melbourne, NICTA Victoria Lab, Melbourne, Vic, Australia
[2] Univ Melbourne, NICTA ATP Lab, Melbourne, Vic, Australia
关键词
Discrimination aware classification; imbalanced datasets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of learning a discrimination aware model has recently received attention in the data mining community. Various methods and improved models have been proposed, with the main approach being the detection of a discrimination sensitive attribute. Once the discrimination sensitive attribute is identified, the methods aim to develop a strategy that will include the useful information from that attribute without causing any additional discrimination. Our work focuses on an aspect often overlooked in the discrimination aware classification - the scenario of an imbalanced dataset, where the number of samples from one class is disproportionate to the other. We also investigate a strategy that is directly minimizing discrimination and is independent of the class balance. Our empirical results indicate additional concerns that need to be considered when developing discrimination aware classifiers, and our proposed strategy shows promise in overcoming these concerns.
引用
收藏
页码:1529 / 1532
页数:4
相关论文
共 50 条
  • [31] Improving the Performance of Sentiment Classification on Imbalanced Datasets With Transfer Learning
    Xiao, Z.
    Wang, L.
    Du, J. Y.
    IEEE ACCESS, 2019, 7 : 28281 - 28290
  • [32] Binary classification of imbalanced datasets: The case of CoIL challenge 2000
    Darzi, Mohammad Rasoul Khalilpour
    Niaki, Seyed Taghi Akhavan
    Khedmati, Majid
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 128 (169-186) : 169 - 186
  • [33] RSMOTE: improving classification performance over imbalanced medical datasets
    Mehdi Naseriparsa
    Ahmed Al-Shammari
    Ming Sheng
    Yong Zhang
    Rui Zhou
    Health Information Science and Systems, 8
  • [34] Preprocessing compensation techniques for improved classification of imbalanced medical datasets
    Wosiak, Agnieszka
    Karbowiak, Sylwia
    PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, : 203 - 211
  • [35] Effects of the Use of Boosting on Classification Performance of Imbalanced Bioinformatics Datasets
    Khoshgoftaar, Taghi M.
    Fazelpour, Alireza
    Dittman, David J.
    Napolitano, Amri
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2014, : 420 - 426
  • [36] Improving SVM Classification on Imbalanced Datasets by Introducing a New Bias
    Nunez, Haydemar
    Gonzalez-Abril, Luis
    Angulo, Cecilio
    JOURNAL OF CLASSIFICATION, 2017, 34 (03) : 427 - 443
  • [37] KerMinSVM for imbalanced datasets with a case study on arabic comics classification
    Nayal, Ammar
    Jomaa, Hadi
    Awad, Marlette
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 59 : 159 - 169
  • [38] GradMix for Nuclei Segmentation and Classification in Imbalanced Pathology Image Datasets
    Doan, Tan Nhu Nhat
    Kim, Kyungeun
    Song, Boram
    Kwak, Jin Tae
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 171 - 180
  • [39] Robustness of Image Classification on Imbalanced Datasets Using Capsules Networks
    Onana, Steve
    Tchuani, Diane
    Tinku, Claude
    Fippo, Louis
    Kouamou, Georges Edouard
    RESEARCH IN COMPUTER SCIENCE, CRI 2023, 2024, 2085 : 53 - 68
  • [40] Improving SVM Classification on Imbalanced Datasets by Introducing a New Bias
    Haydemar Núñez
    Luis Gonzalez-Abril
    Cecilio Angulo
    Journal of Classification, 2017, 34 : 427 - 443