Discrimination Aware Classification for Imbalanced Datasets

被引:0
|
作者
Ristanoski, Goce [1 ]
Liu, Wei [2 ]
Bailey, James [1 ]
机构
[1] Univ Melbourne, NICTA Victoria Lab, Melbourne, Vic, Australia
[2] Univ Melbourne, NICTA ATP Lab, Melbourne, Vic, Australia
关键词
Discrimination aware classification; imbalanced datasets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of learning a discrimination aware model has recently received attention in the data mining community. Various methods and improved models have been proposed, with the main approach being the detection of a discrimination sensitive attribute. Once the discrimination sensitive attribute is identified, the methods aim to develop a strategy that will include the useful information from that attribute without causing any additional discrimination. Our work focuses on an aspect often overlooked in the discrimination aware classification - the scenario of an imbalanced dataset, where the number of samples from one class is disproportionate to the other. We also investigate a strategy that is directly minimizing discrimination and is independent of the class balance. Our empirical results indicate additional concerns that need to be considered when developing discrimination aware classifiers, and our proposed strategy shows promise in overcoming these concerns.
引用
收藏
页码:1529 / 1532
页数:4
相关论文
共 50 条
  • [21] RSMOTE: improving classification performance over imbalanced medical datasets
    Naseriparsa, Mehdi
    Al-Shammari, Ahmed
    Sheng, Ming
    Zhang, Yong
    Zhou, Rui
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [22] A contemporary feature selection and classification framework for imbalanced biomedical datasets
    Bikku, Thulasi
    Nandam, Sambasiva Rao
    Akepogu, Ananda Rao
    EGYPTIAN INFORMATICS JOURNAL, 2018, 19 (03) : 191 - 198
  • [23] Kernel-Based SMOTE for SVM Classification of Imbalanced Datasets
    Mathew, Josey
    Luo, Ming
    Pang, Chee Khiang
    Chan, Hian Leng
    IECON 2015 - 41ST ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2015, : 1127 - 1132
  • [24] An efficient classification approach in imbalanced datasets for intrinsic plagiarism detection
    Andrianna Polydouri
    Eleni Vathi
    Georgios Siolas
    Andreas Stafylopatis
    Evolving Systems, 2020, 11 : 503 - 515
  • [25] Imbalanced datasets classification by fuzzy rule extraction and genetic algorithms
    Soler, Vicenc
    Cerquides, Jesus
    Sabria, Josep
    Roig, Jordi
    Prim, Marta
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 330 - 334
  • [26] Balanced Sampling Meets Imbalanced Datasets in SAR Image Classification
    Jahan, Chowdhury Sadman
    Savakis, Andreas
    GEOSPATIAL INFORMATICS XIII, 2023, 12525
  • [27] An efficient classification approach in imbalanced datasets for intrinsic plagiarism detection
    Polydouri, Andrianna
    Vathi, Eleni
    Siolas, Georgios
    Stafylopatis, Andreas
    EVOLVING SYSTEMS, 2020, 11 (03) : 503 - 515
  • [28] GUM: A Guided Undersampling Method to Preprocess Imbalanced Datasets for Classification
    Sung, Kisuk
    Brown, W. Eric
    Moreno-Centeno, Erick
    Ding, Yu
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 1086 - 1091
  • [29] Empirical Study of Sampling Methods for Classification in Imbalanced Clinical Datasets
    Kasem, Asem
    Ghaibeh, A. Ammar
    Moriguchi, Hiroki
    COMPUTATIONAL INTELLIGENCE IN INFORMATION SYSTEMS, CIIS 2016, 2017, 532 : 152 - 162
  • [30] An improved Support Vector Machine for the classification of imbalanced biological datasets
    Wang, Haiying
    Zheng, Huiru
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 63 - +