A label noise filtering method for regression based on adaptive threshold and noise score

被引:9
|
作者
Li, Chuang [1 ]
Mao, Zhizhong [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
关键词
Noise filter; Real-valued label noise; Adaptive noise determination; Noise score; Ensemble filtering; Iterative filtering; CLASSIFICATION; PERFORMANCE; SELECTION; PREDICTION; RANKING; FUSION; TESTS; SET;
D O I
10.1016/j.eswa.2023.120422
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quality of training data plays a decisive role in the establishment of intelligent models. Since raw data obtained from the real world are usually entwined with noise due to variety of causes, noise filtering has become an important aspect of machine learning techniques. In contrast with the extensive research conducted on noise elimination for classification purposes, papers addressing this problem for regression tasks are rather scarce. In this paper, we propose a novel noise filter to clean noisy instances with real-valued label noise. Aiming at the deficiency of the existing noise determination criterion, a new adaptive threshold-based method is first proposed. It allows a noisy instance to be adaptively defined according to the fitting difficulty levels of different datasets, and areas with different densities. Embedded with this criterion, an effective noise filtering procedure is also designed. An ensemble filtering scheme and an iterative filtering process are combined to detect as many po-tential noisy samples as possible from the original training set. According to the acquire noise detection infor-mation, a noise score for evaluating the noise level is specifically developed. The potential noisy samples whose scores exceed a reasonable threshold are further filtered, which can compensate for the possible errors incurred during the previous procedure, and contribute to more reliable filtering results. The validity of the proposed method is studied in exhaustive experiments. We discuss reasonable hyperparameters, and compare the devel-oped method with several state-of-the-art noise filters. The outcomes show that the prediction accuracy of the utilized regressor can greatly benefit from preprocessing the given raw dataset by using our method. Simulta-neously, the method is able to acquire a good balance between the elimination of noisy samples and the retention of clean samples, and consistently achieves a better noise filtering performance.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Removing of Salt-and-pepper Noise in Images Based on Adaptive Median Filtering and Improved Threshold Function
    Qian, Ying
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1431 - 1436
  • [22] A new robust adaptive algorithm based adaptive filtering for noise cancellation
    Awwab Qasim Jumaah Althahab
    Analog Integrated Circuits and Signal Processing, 2018, 94 : 217 - 231
  • [23] A new robust adaptive algorithm based adaptive filtering for noise cancellation
    Althahab, Awwab Qasim Jumaah
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2018, 94 (02) : 217 - 231
  • [24] A new adaptive filtering method for removing salt and pepper noise based on multilayered PCNN
    Deng, Xiangyu
    Ma, Yide
    Dong, Min
    PATTERN RECOGNITION LETTERS, 2016, 79 : 8 - 17
  • [25] Adaptive Filtering-based Heavy-Noise Removal in Born Iterative Method
    Quang-Huy, Tran
    Theu, Luong Thi
    Minh, Nguyen Canh
    Tran, Duc-Nghia
    Tran, Duc-Tan
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 2041 - 2045
  • [26] ADAPTIVE FOURIER THRESHOLD FILTERING - A METHOD TO REDUCE NOISE AND INCOHERENT ARTIFACTS IN HIGH-RESOLUTION CARDIAC IMAGES
    DOYLE, M
    CHAPMAN, BLW
    BLACKWELL, G
    WALSH, EG
    POHOST, GM
    MAGNETIC RESONANCE IN MEDICINE, 1994, 31 (05) : 546 - 550
  • [27] Enhanced Label Noise Filtering with Multiple Voting
    Guan, Donghai
    Hussain, Maqbool
    Yuan, Weiwei
    Khattak, Asad Masood
    Fahim, Muhammad
    Khan, Wajahat Ali
    APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [28] Gesture image segmentation with Otsu's method based on noise adaptive angle threshold
    Xiao, Leyi
    Ouyang, Honglin
    Fan, Chaodong
    Umer, Tariq
    Poonia, Ramesh Chandra
    Wan, Shaohua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35619 - 35640
  • [29] Gesture image segmentation with Otsu’s method based on noise adaptive angle threshold
    Leyi Xiao
    Honglin Ouyang
    Chaodong Fan
    Tariq Umer
    Ramesh Chandra Poonia
    Shaohua Wan
    Multimedia Tools and Applications, 2020, 79 : 35619 - 35640
  • [30] Cluster Validation Measures for Label Noise Filtering
    Boeva, Veselka
    Lundberg, Lars
    Angelova, Milena
    Kohstall, Jan
    2018 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2018, : 109 - 116