Evaluation of Nonparametric and Parametric Statistical Procedures for Modeling and Prediction of Cluster-Correlated Hydroclimatic Data

被引:4
|
作者
Singh, Sarmistha [1 ]
Abebe, Ash [2 ]
Srivastava, Puneet [3 ]
机构
[1] Indian Inst Technol, Dept Civil Engn, Palakkad, India
[2] Auburn Univ, Dept Math & Stat, Auburn, AL 36849 USA
[3] Auburn Univ, Biosyst Engn, Auburn, AL 36849 USA
关键词
joint rank fit; Wilcoxon rank sum; restricted maximum likelihood; baseflow; El Nino-Southern Oscillation; EL-NINO/SOUTHERN OSCILLATION; NINO-SOUTHERN-OSCILLATION; REGRESSION-COEFFICIENTS; CLIMATE VARIABILITY; ROBUST ESTIMATION; ENSO INFLUENCES; LINEAR-MODELS; UNITED-STATES; STREAMFLOW; PRECIPITATION;
D O I
10.1029/2017WR022254
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Climate and hydrologic variables such as temperature, precipitation, streamflow, and baseflow generally do not follow Gaussian distribution due to the presence of outliers and heavy tails. Therefore, they are usually analyzed using the nonparametric Wilcoxon rank sum test rather than parametric methods like classical t tests and analysis of variance. Furthermore, in addition to having a non-Gaussian distribution, these data exhibit monthly/seasonal variability, which leads to within month/season cluster correlation. In this study, a nonparametric procedure, called joint rank fit (JRFit), for analyzing cluster-correlated data was implemented and compared against traditional methods such as restricted maximum likelihood, least absolute deviations, and rank-based fit (a model-based extension of Wilcoxon rank sum) for studying the coupled effect of the phases of El Nino-Southern Oscillation and Atlantic Multidecadal Oscillation on baseflow levels. The results from a large Monte Carlo simulation experiment showed that JRFit was more efficient than the other three methods for data with (i) high variability, (ii) outliers due to contamination, or (iii) strong monthly/seasonal correlation. The efficiency gain of JRFit was up to 50% compared to restricted maximum likelihood for heavy-tailed and highly correlated data. Predictive performance evaluated using the mean absolute prediction error and mean prediction standard error from an out-of-sample cross-validation study showed JRFit to be optimal for providing predictions of baseflow on the basis of the phases of El Nino-Southern Oscillation and Atlantic Multidecadal Oscillation. Thus, it is recommended that JRFit be implemented in hydroclimatic studies to provide powerful inference when there is evidence of clustering in the data.
引用
收藏
页码:6948 / 6964
页数:17
相关论文
共 50 条
  • [1] Handbook of parametric and nonparametric statistical procedures
    Austin, E
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2005, 58 : 382 - 382
  • [2] Handbook of parametric and nonparametric statistical procedures
    Severance, N
    CONTEMPORARY PSYCHOLOGY-APA REVIEW OF BOOKS, 1998, 43 (12): : 834 - 835
  • [3] A weighted multivariate sign test for cluster-correlated data
    Larocque, Denis
    Nevalainen, Jaakko
    Oja, Hannu
    BIOMETRIKA, 2007, 94 (02) : 267 - 283
  • [4] A fast kernel independence test for cluster-correlated data
    Song, Hoseung
    Liu, Hongjiao
    Wu, Michael C.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] A fast kernel independence test for cluster-correlated data
    Hoseung Song
    Hongjiao Liu
    Michael C. Wu
    Scientific Reports, 12
  • [6] A note on robust variance estimation for cluster-correlated data
    Williams, RL
    BIOMETRICS, 2000, 56 (02) : 645 - 646
  • [7] Robust inference for sparse cluster-correlated count data
    Hanfelt, John J.
    Li, Ruosha
    Pan, Yi
    Payment, Pierre
    JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (01) : 182 - 192
  • [9] A Kernel-based Test of Independence for Cluster-correlated Data
    Liu, Hongjiao
    Plantinga, Anna M.
    Xiang, Yunhua
    Wu, Michael C.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] On the Analysis of Case-Control Studies in Cluster-correlated Data Settings
    Haneuse, Sebastien
    Rivera-Rodriguez, Claudia
    EPIDEMIOLOGY, 2018, 29 (01) : 50 - 57