Knowledge-Informed Sparse Learning for Relevant Feature Selection and Optimal Quality Prediction

被引:3
|
作者
Liu, Yiren [1 ,2 ]
Qin, S. Joe [1 ,2 ,3 ]
机构
[1] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Hong Kong Inst Data Sci, Hong Kong, Peoples R China
[3] Lingnan Univ, Inst Data Sci, Hong Kong, Peoples R China
关键词
Industrial applications; online trend adaption; physically relevant variables; sparse learning; variable selection; VARIABLE SELECTION; INFERENTIAL SENSORS;
D O I
10.1109/TII.2023.3247008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Industrial data are usually collinear, which can cause pure data-driven sparse learning to deselect physically relevant variables and select collinear surrogates. In this article, a novel two-step learning approach to retaining knowledge-informed variables (KIVs) is proposed to build inferential models. The first step is an improved knowledge-informed Lasso (KILasso) algorithm by removing penalty on the KIVs to produce a series of candidate subsets that guarantee the retention of the KIVs. The candidate subsets are then used to run the KILasso or ridge regression again to select the best sets of variables and estimate the final model. Two new algorithms are proposed and applied to datasets from an industrial boiler process and the Dow Chemical challenge problem. It is demonstrated that some important physically relevant variables are deselected by pure data-driven sparse methods, but they are retained using the proposed knowledge-informed methods with superior prediction performance.
引用
收藏
页码:11499 / 11507
页数:9
相关论文
共 50 条
  • [1] Knowledge-Informed Molecular Learning: A Survey on Paradigm Transfer
    Fang, Yin
    Chen, Zhuo
    Fan, Xiaohui
    Zhang, Ningyu
    Chen, Huajun
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2024, 2024, 14884 : 86 - 98
  • [2] Knowledge-Informed Machine Learning for Cancer Diagnosis and Prognosis: A Review
    Mao, Lingchao
    Wang, Hairong
    Hu, Leland S.
    Tran, Nhan L.
    Canoll, Peter D.
    Swanson, Kristin R.
    Li, Jing
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [3] Sparse Learning Method With Feature Selection for Sensor Placement and Response Prediction
    Zhang, Minzhao
    Ding, Junliang
    Li, Bin
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (06) : 8022 - 8033
  • [4] Automated Antenna Design via Domain Knowledge-Informed Reinforcement Learning and Imitation Learning
    Wei, Zhaohui
    Zhou, Zhao
    Wang, Peng
    Ren, Jian
    Yin, Yingzeng
    Pedersen, Gert Frolund
    Shen, Ming
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2023, 71 (07) : 5549 - 5557
  • [5] X-LDA: An interpretable and knowledge-informed heterogeneous graph learning framework for LncRNA-disease association prediction
    Cao, Yangkun
    Xiao, Jun
    Sheng, Nan
    Qu, Yinwei
    Wang, Zhihang
    Sun, Chang
    Mu, Xuechen
    Huang, Zhenyu
    Li, Xuan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 167
  • [6] Knowledge-Informed Deep Learning Model for Subsurface Thermohaline Reconstruction From Satellite Observations
    Wang, An
    Su, Hua
    Huang, Zhanchao
    Yan, Xiao-Hai
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [7] A Survey on Sparse Learning Models for Feature Selection
    Li, Xiaoping
    Wang, Yadi
    Ruiz, Ruben
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (03) : 1642 - 1660
  • [8] Relevant based structure learning for feature selection
    Zare, Hadi
    Niazi, Mojtaba
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 55 : 93 - 102
  • [9] Optimal Feature Selection and Machine Learning for Prediction of Outcome in Parkinson's Disease
    Salmanpour, Mohammad
    Saberi, Abdollah
    Shamsaei, Mojtaba
    Rahmim, Arman
    JOURNAL OF NUCLEAR MEDICINE, 2020, 61
  • [10] A Novel Domain Knowledge-Informed Machine Learning Approach for Modeling Solid Waste Management Systems
    He, Rui
    Small, Mitchell J.
    Scott, Ian J.
    Olarinre, Motolani
    Sandoval-Reyes, Mexitli
    Ferrao, Paulo
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (46) : 18215 - 18224