NUMERIC PREDICTION OF DISSOLVED OXYGEN STATUS THROUGH TWO-STAGE TRAINING FOR CLASSIFICATION-DRIVEN REGRESSION

被引:1
|
作者
Guo, Pengfei [1 ,2 ]
Liu, Han [3 ]
Liu, Shuangyin [2 ,4 ]
Xu, Longqin [2 ,4 ]
机构
[1] Zhongkai Univ Agr & Engn, Coll Computat Sci, Guangzhou 510225, Peoples R China
[2] Guangdong Higher Educ Inst, Intelligent Agr Engn Res Ctr, Guangzhou 510225, Peoples R China
[3] Cardiff Univ, Sch Comp Sci & Informat, Cardiff CF24 3AA, Wales
[4] Zhongkai Univ Agr & Engn, Coll Informat Sci & Technol, Guangzhou 510225, Peoples R China
来源
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC) | 2019年
基金
中国国家自然科学基金;
关键词
Machine learning; Regression; Dissolved oxygen; NEURAL-NETWORK;
D O I
10.1109/icmlc48188.2019.8949196
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dissolved oxygen of aquaculture is an important measure of the quality of culture environment and how aquatic products have been grown. In the machine learning context, the above measure can be achieved by defining a regression problem, which aims at numerical prediction of the dissolved oxygen status. In general, the vast majority of popular machine learning algorithms were designed for undertaking classification tasks. In order to effectively adopt the popular machine learning algorithms fir the above-mentioned numerical prediction, in this paper, we propose a two-stage training approach that involves transforming a regression problem into a classification problem and then transforming it back to regression problem. In particular, unsupervised discretization of continuous attributes is adopted at the first stage to transform the target (numeric) attribute into a discrete (nominal) one with several intervals, such that popular machine learning algorithms can be used to predict the interval to which an instance belongs in the setting of a classification task. Furthermore, based on the classification result at the first stage, some of the instances within the predicted interval are selected for training at the second stage towards numerical prediction of the target attribute value of each instance. An experimental study is conducted to investigate in general the effectiveness of the popular learning algorithms in the numerical prediction task and also analyze how the increase of the number of training instances (selected at the second training stage) can impact on the final prediction performance. The results show that the adoption of decision tree learning and neural networks lead to better and more stable performance than Naive Bayes, K Nearest Neighbours and Support Vector Machine.
引用
收藏
页码:101 / 106
页数:6
相关论文
共 50 条
  • [11] Two-Stage Sampling, Prediction and Adaptive Regression via Correlation Screening
    Firouzi, Hamed
    Hero, Alfred O., III
    Rajaratnam, Bala
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2017, 63 (01) : 698 - 714
  • [12] Two-Stage Training Framework Using Multicontrast MRI Radiomics for IDH Mutation Status Prediction in Glioma
    Truong, Nghi C. D.
    Yogananda, Chandan Ganesh Bangalore
    Wagner, Benjamin C.
    Holcomb, James M.
    Reddy, Divya
    Saadat, Niloufar
    Hatanpaa, Kimmo J.
    Patel, Toral R.
    Fei, Baowei
    Lee, Matthew D.
    Jain, Rajan
    Bruce, Richard J.
    Pinho, Marco C.
    Madhuranthakam, Ananth J.
    Maldjian, Joseph A.
    RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2024, 6 (04)
  • [13] A Fuzzy Approach to Text Classification With Two-Stage Training for Ambiguous Instances
    Liu, Han
    Burnap, Pete
    Alorainy, Wafa
    Williams, Matthew L.
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (02): : 227 - 240
  • [14] GenerCTC: a general two-stage contrastive training framework for text classification
    Lei, Jianjun
    Chen, Sida
    Wang, Ying
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [15] A two-stage regression model for epidemiological studies with multivariate disease classification data
    Chatterjee, N
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) : 127 - 138
  • [16] Effects of dissolved oxygen concentration and two-stage oxygen supply strategy on the production of γ-CGTase by Bacillus macorous
    Wang, F
    Du, GC
    Li, Y
    Chen, J
    PROCESS BIOCHEMISTRY, 2005, 40 (11) : 3468 - 3473
  • [17] A two-stage Gaussian process regression model for remaining useful prediction of bearings
    Cui, Jin
    Cao, Licai
    Zhang, Tianxiao
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART O-JOURNAL OF RISK AND RELIABILITY, 2024, 238 (02) : 333 - 348
  • [18] A two-stage case-based reasoning driven classification paradigm for financial distress prediction with missing and imbalanced data
    Yu, Lean
    Li, Mengxin
    Liu, Xiaojun
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [19] Clustering based Two-Stage Text Classification Requiring Minimal Training Data
    Zhang, Xue
    Xiao, Wangxin
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, 9 (04) : 1627 - 1643
  • [20] A Two-Stage Bayesian Data-Driven Method to Improve Model Prediction
    Sun, Xiaozhuo
    Zeng, Xiankui
    Wu, Jichun
    Wang, Dong
    WATER RESOURCES RESEARCH, 2021, 57 (12)