Adverse Media Classification: A New Era of Risk Management with XGBoost and Gradient Boosting Algorithms

被引:0
|
作者
Juliandri, Reza [1 ]
Johan, Monika Evelin [1 ]
Wiratama, Jansen [1 ]
Sanjaya, Samuel Ady [1 ]
机构
[1] Univ Multimedia Nusantara, Informat Syst, Tangerang, Indonesia
关键词
adverse media; classification; gradient boosting; website; XGBoost;
D O I
10.1109/IBDAP62940.2024.10689708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adverse media is negative information that is not profitable for businesses or individuals, while adverse media classification is the process of classifying news titles that are included in adverse media. In an effort to create a system capable of mitigating the occurrence of fraud for customer satisfaction, machine learning is used to classify news both as detrimental media and not for the selection of news for the customer due diligence system. This study utilizes the XGBoost and Gradient Boosting algorithms to classify news headlines. A data set of 1,281 records was collected from NewsAPI and web scraping. Back translation is used in the data preparation stage to deal with unbalanced data sets and create text variants Grid search is used to find the best hyperparameters for Gradient Boosting and XGBoost. The results of the research are in the form of a machine-learning model. Across all models examined, Gradient Boosting trained on 753 records performed best with an accuracy rate of 82.31% on test data and 84.93% on validation data. This model is able to be used to classify media and then implemented in a web-based interface.
引用
收藏
页码:18 / 21
页数:4
相关论文
共 50 条
  • [21] Advertising Management Strategies of Chinese Urban Newspaper in the New Media Era
    Ma, Meigui
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY AND MANAGEMENT SCIENCE (ITMS 2015), 2015, 34 : 714 - 716
  • [22] Research on Campus Emergencies Management of Private Universities in New Media Era
    Zhao, Yan
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE, EDUCATION AND HUMANITIES RESEARCH (ICSEHR 2017), 2017, 152 : 133 - 135
  • [23] Management Strategies of Corporate Crisis Public Relations in the New Media Era
    Zhang Xianhui
    2013 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL MANUFACTURING AND AUTOMATION (ICDMA), 2013, : 756 - 759
  • [24] New Algorithms for the Prediction of Cardiovascular Risk The Post-Diamond-Forrester Era
    Rozanski, Alan
    Berman, Daniel S.
    JAMA CARDIOLOGY, 2017, 2 (04) : 359 - 360
  • [25] An effective adaptive customization framework for small manufacturing plants using extreme gradient boosting-XGBoost and random forest ensemble learning algorithms in an Industry 4.0 environment
    Kiangala, Sonia Kahiomba
    Wang, Zenghui
    MACHINE LEARNING WITH APPLICATIONS, 2021, 4 (04):
  • [26] New boosting algorithms for classification problems with large number of classes applied to a handwritten word recognition task
    Günter, S
    Bunke, H
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDING, 2003, 2709 : 326 - 335
  • [27] Algorithms vs. human nature: The research on the group polarisation phenomenon in the new media era
    Peng, Yiting
    Wang, Ting
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2024, 59 : 167 - 167
  • [28] Introduction to Knowledge Management in an Era of New Social Media: Risks and Opportunities Minitrack
    Mason, Robert M.
    Ford, Dianne P.
    PROCEEDINGS OF THE 46TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2013, : 3603 - 3603
  • [29] INCREASED CORONARY PERFORATION IN THE NEW DEVICE ERA - INCIDENCE, CLASSIFICATION, MANAGEMENT, AND OUTCOME
    ELLIS, SG
    AJLUNI, S
    ARNOLD, AZ
    POPMA, JJ
    BITTL, JA
    EIGLER, NL
    COWLEY, MJ
    RAYMOND, RE
    SAFIAN, RD
    WHITLOW, PL
    CIRCULATION, 1994, 90 (06) : 2725 - 2730
  • [30] INCREASED CORONARY PERFORATION IN THE NEW DEVICE ERA - INCIDENCE, CLASSIFICATION, MANAGEMENT AND OUTCOME
    ELLIS, SG
    ARNOLD, AZ
    RAYMOND, RE
    EIGLER, NL
    SANBORN, TA
    BITTL, JA
    LINCOFF, AM
    MOONEY, MR
    TCHENG, JE
    TOPOL, EJ
    CIRCULATION, 1992, 86 (04) : 787 - 787