Distributed defect recognition on steel surfaces using an improved random forest algorithm with optimal multi-feature-set fusion

被引:48
|
作者
Wang, Yalin [1 ]
Xia, Haibing [1 ]
Yuan, Xiaofeng [1 ]
Li, Ling [1 ]
Sun, Bei [1 ]
机构
[1] Cent South Univ, Sch Informat Sci & Engn, Changsha, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Steel surface; Distributed defect recognition; Histogram of oriented gradient (HOG); Gray-level co-occurrence matrix (GLCM); Random forest (RF); Optimal multi-feature-set fusion (OMFF); CLASSIFICATION; SYSTEM; IMAGE; HOG;
D O I
10.1007/s11042-017-5238-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inspecting steel surfaces is important to ensure steel quality. Numerous defect-detection methods have been developed for steel surfaces. However, they are primarily used for local defects, and their accuracy in detecting distributed defects is unsatisfactory because such defects are difficult to locate and have complex texture characteristics. To solve these issues, an improved random forest algorithm with optimal multi-feature-set fusion (OMFF-RF algorithm) is proposed for distributed defect recognition in this paper. The OMFF-RF algorithm includes the following three aspects. First, a histogram of oriented gradient (HOG) feature-set and a gray-level co-occurrence matrix (GLCM) feature-set are extracted and fused to describe local and global texture characteristics, respectively. Second, given the small number of samples of distributed defect images and the high dimensionality of the extracted feature-sets, a random forest algorithm is introduced to perform defect classification. Third, the feature-sets vary greatly in performance and dimensionality. To improve the fusion efficiency, OMFF-RF merges the HOG feature-set and the GLCM feature-set through a multi-feature-set fusion factor, which changes the number of decision trees that correspond to each feature-set in the RF algorithm. The OMFF factor is found by optimizing the fitting curve of the classification accuracy of the test set using a stepping multi-feature-set fusion factor. In experiments, the effectiveness of the proposed OMFF-RF was verified using 5 types of distributed defects collected from an actual steel production line. OMFF-RF achieved a recognition accuracy of 91%, a result superior to support vector machine (SVM) and conventional RF algorithms.
引用
收藏
页码:16741 / 16770
页数:30
相关论文
共 34 条
  • [31] Evaluating total inorganic nitrogen in coastal waters through fusion of multi-temporal RADARSAT-2 and optical imagery using random forest algorithm
    Liu, Meiling
    Liu, Xiangnan
    Li, Jin
    Ding, Chao
    Jiang, Jiale
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2014, 33 : 192 - 202
  • [32] Multi-objective optimal dispatching of combined cooling, heating and power using hybrid gravitational search algorithm and random forest regression: Towards the microgrid orientation
    Nazir, Muhammad Shahzad
    Almasoudi, Fahad M.
    Abdalla, Ahmad N.
    Zhu, Chang
    Alatawi, Khaled Saleem S.
    ENERGY REPORTS, 2023, 9 : 1926 - 1936
  • [33] Multi-view feature fusion for rolling bearing fault diagnosis using random forest and autoencoder; [基于随机森林和自编码的滚动轴承多视角特征融合]
    Sun W.
    Deng A.
    Deng M.
    Zhu J.
    Zhai Y.
    Cheng Q.
    Liu Y.
    Journal of Southeast University (English Edition), 2019, 35 (03): : 302 - 309
  • [34] Multi-class Recognition of Alzheimer's and Parkinson's diseases using Bag of Deep reduced Features (BoDrF) with Improved Chaotic Multi Verse Harris Hawks Optimization (CMVHHO) and Random Forest (RF) based classification for early diagnosis
    Balaji, Chetan
    Suresh, D. S.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (03): : 774 - 785