On Machine Learning with Imbalanced Data and Research Quality Evaluation Methodologies

被引:0
|
作者
Lipitakis, Anastasia-Dimitra [1 ]
Lipitakis, Evangelia A. E. C. [2 ]
机构
[1] Univ Patras, Dept Math, Patras 26504, Hellas, Greece
[2] Univ Kent, Kent Business Sch, Canterbury CT2 7PE, Kent, England
关键词
Bibliometric Indicators; Business Intelligence; Citation Analysis; Computational Intelligence; Data Mining; Learning Algorithms; Imbalanced Data; Machine Learning; Quantitative Methods; Research Quality Evaluation; FIRM PERFORMANCE; ROTATION FOREST; E-BUSINESS; CLASSIFICATION; INTELLIGENCE; PREDICTION; STRATEGY;
D O I
10.1109/CSCI.2014.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article a synoptic review of machine learning techniques with imbalanced data and a class of corresponding learning algorithms is presented. This class of algorithms includes the meta-algorithms: Cost sensitive, Metacost, Rotation forest-cost sensitive, rotation forest-smote. Four learning algorithms (with base classifiers J48 and part processing with F-measure and a predetermined imbalanced data set) are compared in the computational environment WEKA leading to comparative numerical results. The basic concepts of research quality evaluation methodologies are presented, an adaptive citation qualitative-quantitative approach and advanced bibliometric indicators are given. Basic components of research quality performance such as research journal cited publications, citing publications and research quality evaluations at various academic levels are considered and corresponding numerical results are given. An alternative approach using certain machine learning algorithms with imbalanced data in the case of research quality evaluation methodologies is proposed.
引用
收藏
页码:451 / 457
页数:7
相关论文
共 50 条
  • [1] Machine learning for mining imbalanced data
    Arafat, Md. Yasir
    Hoque, Sabera
    Xu, Shuxiang
    Farid, Dewan Md
    IAENG International Journal of Computer Science, 2019, 46 (02) : 332 - 348
  • [2] Imbalanced generative sampling of training data for improving quality of machine learning model
    Coskun, Umut Can
    Dogan, Kemal Mert
    Gunpinar, Erkan
    ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [3] Imbalanced Data Problem in Machine Learning: A Review
    Altalhan, Manahel
    Algarni, Abdulmohsen
    Alouane, Monia Turki-Hadj
    IEEE ACCESS, 2025, 13 : 13686 - 13699
  • [4] Machine Learning on Imbalanced Data in Credit Risk
    Birla, Shiivong
    Kohli, Kashish
    Dutta, Akash
    7TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE IEEE IEMCON-2016, 2016,
  • [5] Integrating Data Selection and Extreme Learning Machine for Imbalanced Data
    Mahdiyah, Umi
    Irawan, M. Isa
    Imah, Elly Matul
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2015), 2015, 59 : 221 - 229
  • [6] Data Evaluation and Enhancement for Quality Improvement of Machine Learning
    Chen, Haihua
    Chen, Jiangping
    Ding, Junhua
    IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (02) : 831 - 847
  • [7] Data Evaluation and Enhancement for Quality Improvement of Machine Learning
    Chen, Haihua
    Chen, Jiangping
    Ding, Junhua
    2020 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY (QRS 2020), 2020, : 13 - 13
  • [8] Adversarial Approaches to Tackle Imbalanced Data in Machine Learning
    Ayoub, Shahnawaz
    Gulzar, Yonis
    Rustamov, Jaloliddin
    Jabbari, Abdoh
    Reegu, Faheem Ahmad
    Turaev, Sherzod
    SUSTAINABILITY, 2023, 15 (09)
  • [9] Evolutionary Online Machine Learning from Imbalanced Data
    Stein, Anthony
    2016 IEEE 1ST INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2016, : 281 - 286
  • [10] A comparative analysis of machine learning techniques for imbalanced data
    Mrad, Ali Ben
    Lahiani, Amine
    Mefteh-Wali, Salma
    Mselmi, Nada
    ANNALS OF OPERATIONS RESEARCH, 2024,