Discrimination Aware Classification for Imbalanced Datasets

被引:0
|
作者
Ristanoski, Goce [1 ]
Liu, Wei [2 ]
Bailey, James [1 ]
机构
[1] Univ Melbourne, NICTA Victoria Lab, Melbourne, Vic, Australia
[2] Univ Melbourne, NICTA ATP Lab, Melbourne, Vic, Australia
关键词
Discrimination aware classification; imbalanced datasets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of learning a discrimination aware model has recently received attention in the data mining community. Various methods and improved models have been proposed, with the main approach being the detection of a discrimination sensitive attribute. Once the discrimination sensitive attribute is identified, the methods aim to develop a strategy that will include the useful information from that attribute without causing any additional discrimination. Our work focuses on an aspect often overlooked in the discrimination aware classification - the scenario of an imbalanced dataset, where the number of samples from one class is disproportionate to the other. We also investigate a strategy that is directly minimizing discrimination and is independent of the class balance. Our empirical results indicate additional concerns that need to be considered when developing discrimination aware classifiers, and our proposed strategy shows promise in overcoming these concerns.
引用
收藏
页码:1529 / 1532
页数:4
相关论文
共 50 条
  • [1] To improve classification of imbalanced datasets
    Shukla, Pratyusha
    Bhowmick, Kiran
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [2] HAR: Hardness Aware Reweighting for Imbalanced Datasets
    Duggal, Rahul
    Freitas, Scott
    Dhamnani, Sunny
    Chau, Duen Horng
    Sun, Jimeng
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 735 - 745
  • [3] Classification of Antimicrobial Peptides with Imbalanced Datasets
    Camacho, Francy L.
    Torres, Rodrigo
    Ramos Pollan, Raul
    11TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2015, 9681
  • [4] A robust loss function for classification with imbalanced datasets
    Wang, Yidan
    Yang, Liming
    NEUROCOMPUTING, 2019, 331 : 40 - 49
  • [5] Imbalanced classification in sparse and large behaviour datasets
    Jellis Vanhoeyveld
    David Martens
    Data Mining and Knowledge Discovery, 2018, 32 : 25 - 82
  • [6] FLSOM with Different Rates for Classification in Imbalanced Datasets
    Machon-Gonzalez, Ivan
    Lopez-Garcia, Hilario
    ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT I, 2008, 5163 : 642 - 651
  • [7] Imbalanced classification in sparse and large behaviour datasets
    Vanhoeyveld, Jellis
    Martens, David
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (01) : 25 - 82
  • [8] Categorical classifiers in multiclass classification with imbalanced datasets
    Carpita, Maurizio
    Golia, Silvia
    STATISTICAL ANALYSIS AND DATA MINING-AN ASA DATA SCIENCE JOURNAL, 2023, 16 (04): : 391 - 405
  • [9] Impact of imbalanced datasets on ML algorithms for malware classification
    Mittal, Palak
    Lallie, Harjinder Singh
    Titis, Elzbieta
    INFORMATION SECURITY JOURNAL, 2025,
  • [10] Comparison of Evaluation Metrics in Classification Applications with Imbalanced Datasets
    Fatourechi, Mehrdad
    Ward, Rabab K.
    Mason, Steven G.
    Huggins, Jane
    Schloegl, Alois
    Birch, Gary E.
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 777 - +