New Partially Linear Regression and Machine Learning Models Applied to Agronomic Data

被引:1
|
作者
Rodrigues, Gabriela M. [1 ]
Ortega, Edwin M. M. [1 ]
Cordeiro, Gauss M. [2 ]
机构
[1] Univ Sao Paulo, Dept Exact Sci, BR-13418900 Piracicaba, Brazil
[2] Univ Fed Pernambuco, Dept Stat, BR-50670901 Recife, Brazil
关键词
agronomic experimentation; cross validation; decision tree; maximum likelihood estimation; random forest; residual analysis; CROSS-VALIDATION; CLASSIFICATION; TREE;
D O I
10.3390/axioms12111027
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Regression analysis can be appropriate to describe a nonlinear relationship between the response variable and the explanatory variables. This article describes the construction of a partially linear regression model with two systematic components based on the exponentiated odd log-logistic normal distribution. The parameters are estimated by the penalized maximum likelihood method. Simulations for some parameter settings and sample sizes empirically prove the accuracy of the estimators. The superiority of the proposed regression model over other regression models is shown by means of agronomic experimentation data. The predictive performance of the new model is compared with two machine learning techniques: decision trees and random forests. These methods achieved similar prediction performance, i.e., none stands out as a better predictor. In this sense, the objective of the research is to choose the best method. If the objective is only predictive, the decision tree can be used due to its simplicity. For inference purposes, the regression model is recommended, which can provide much more information regarding the relationship of the variables under study.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] A new approach to data differential privacy based on regression models under heteroscedasticity with applications to machine learning repository data
    Manchini, Carlos
    Ospina, Raydonal
    Leiva, Victor
    Martin-Barreiro, Carlos
    INFORMATION SCIENCES, 2023, 627 : 280 - 300
  • [42] Data Provenance Based System for Classification and Linear Regression in Distributed Machine Learning
    Khan, Muhammad Jahanzeb
    Wang, Ruoyu
    Sun, Daniel
    Li, Guoqiang
    STRUCTURED OBJECT-ORIENTED FORMAL LANGUAGE AND METHOD (SOFL+MSVL 2019), 2020, 12028 : 279 - 295
  • [43] Partially linear hazard regression for multivariate survival data
    Cai, Jianwen
    Fan, Jianqing
    Jiang, Jiancheng
    Zhou, Haibo
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) : 538 - 551
  • [44] Estimation for partially linear additive regression with spatial data
    Tang Qingguo
    Chen Wenyu
    Statistical Papers, 2022, 63 : 2041 - 2063
  • [45] Estimation for partially linear additive regression with spatial data
    Tang Qingguo
    Chen Wenyu
    STATISTICAL PAPERS, 2022, 63 (06) : 2041 - 2063
  • [46] Linear regression models for functional data
    Cardot, Herve
    Sarda, Pascal
    ART OF SEMIPARAMETRICS, 2006, : 49 - +
  • [47] Variable Selection in Semi-Functional Partially Linear Regression Models with Time Series Data
    Meng, Shuyu
    Huang, Zhensheng
    MATHEMATICS, 2024, 12 (17)
  • [48] ASYMPTOTIC THEORY IN FIXED EFFECTS PANEL DATA SEEMINGLY UNRELATED PARTIALLY LINEAR REGRESSION MODELS
    You, Jinhong
    Zhou, Xian
    ECONOMETRIC THEORY, 2014, 30 (02) : 407 - 435
  • [49] Statistical inference of partially linear panel data regression models with fixed individual and time effects
    Liu, Tian
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (15) : 7267 - 7288
  • [50] Estimation and inference for functional linear regression models with partially varying regression coefficients
    Cao, Guanqun
    Wang, Shuoyang
    Wang, Lily
    STAT, 2020, 9 (01):