Knowledge-Informed Sparse Learning for Relevant Feature Selection and Optimal Quality Prediction

被引:3
|
作者
Liu, Yiren [1 ,2 ]
Qin, S. Joe [1 ,2 ,3 ]
机构
[1] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Hong Kong Inst Data Sci, Hong Kong, Peoples R China
[3] Lingnan Univ, Inst Data Sci, Hong Kong, Peoples R China
关键词
Industrial applications; online trend adaption; physically relevant variables; sparse learning; variable selection; VARIABLE SELECTION; INFERENTIAL SENSORS;
D O I
10.1109/TII.2023.3247008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Industrial data are usually collinear, which can cause pure data-driven sparse learning to deselect physically relevant variables and select collinear surrogates. In this article, a novel two-step learning approach to retaining knowledge-informed variables (KIVs) is proposed to build inferential models. The first step is an improved knowledge-informed Lasso (KILasso) algorithm by removing penalty on the KIVs to produce a series of candidate subsets that guarantee the retention of the KIVs. The candidate subsets are then used to run the KILasso or ridge regression again to select the best sets of variables and estimate the final model. Two new algorithms are proposed and applied to datasets from an industrial boiler process and the Dow Chemical challenge problem. It is demonstrated that some important physically relevant variables are deselected by pure data-driven sparse methods, but they are retained using the proposed knowledge-informed methods with superior prediction performance.
引用
收藏
页码:11499 / 11507
页数:9
相关论文
共 50 条
  • [21] Sparse feature selection for classification and prediction of metastasis in endometrial cancer
    Mehmet Eren Ahsen
    Todd P. Boren
    Nitin K. Singh
    Burook Misganaw
    David G. Mutch
    Kathleen N. Moore
    Floor J. Backes
    Carolyn K. McCourt
    Jayanthi S. Lea
    David S. Miller
    Michael A. White
    Mathukumalli Vidyasagar
    BMC Genomics, 18
  • [22] Knowledge-Informed Wheel Wear Prediction Method for High-Speed Train Using Multisource Signal Data
    Chen, Chen
    Zhu, Feng
    Xu, Zhongwei
    Xie, Qinglin
    Lo, Siu Ming
    Tsui, Kwok Leung
    Li, Lishuai
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [23] Optimal Feature Selection of Technical Indicator and Stock Prediction Using Machine Learning Technique
    Naik, Nagaraj
    Mohan, Biju R.
    EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 261 - 268
  • [24] Sparse feature selection for classification and prediction of metastasis in endometrial cancer
    Ahsen, Mehmet Eren
    Boren, Todd P.
    Singh, Nitin K.
    Misganaw, Burook
    Mutch, David G.
    Moore, Kathleen N.
    Backes, Floor J.
    McCourt, Carolyn K.
    Lea, Jayanthi S.
    Miller, David S.
    White, Michael A.
    Vidyasagar, Mathukumalli
    BMC GENOMICS, 2017, 18
  • [25] Unsupervised feature selection via local structure learning and sparse learning
    Lei, Cong
    Zhu, Xiaofeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29605 - 29622
  • [26] Unsupervised feature selection via local structure learning and sparse learning
    Cong Lei
    Xiaofeng Zhu
    Multimedia Tools and Applications, 2018, 77 : 29605 - 29622
  • [27] Sparse learning strategy and key feature selection in interactive learning environment
    Xia, Xiaona
    INTERACTIVE LEARNING ENVIRONMENTS, 2023, 31 (08) : 5141 - 5158
  • [28] Prior knowledge-informed multi-task dynamic learning for few-shot machinery fault diagnosis
    Zhang, Tianci
    Chen, Jinglong
    Ye, Zhisheng
    Liu, Wenting
    Tang, Jinyuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271
  • [29] Combining feature selection, feature learning and ensemble learning for software fault prediction
    Hung Duy Tran
    Le Thi My Hanh
    Nguyen Thanh Binh
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 78 - 85
  • [30] The optimal time slot selection and feature selection for the prediction of drugs for diseases
    Nithya, B.
    Anitha, G.
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 : 2137 - 2151