The R Package Ecosystem for Robust Statistics

被引:0
|
作者
Todorov, Valentin [1 ]
机构
[1] United Nations Ind Dev Org UNIDO, Vienna, Austria
来源
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS | 2024年 / 16卷 / 06期
关键词
high dimensions; multivariate; outlier; R; robust; PRINCIPAL COMPONENT ANALYSIS; PROJECTION-PURSUIT APPROACH; MULTIVARIATE LOCATION; OUTLIER DETECTION; FAST ALGORITHM; REGRESSION; ESTIMATORS; COVARIANCE; DISPERSION; SCATTER;
D O I
10.1002/wics.70007
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the last few years, the number of R packages implementing different robust statistical methods have increased substantially. There are now numerous packages for computing robust multivariate location and scatter, robust multivariate analysis like principal components and discriminant analysis, robust linear models, and other algorithms dedicated to cope with outliers and other irregularities in the data. This abundance of package options may be overwhelming for both beginners and more experienced R users. Here we provide an overview of the most important 25 R packages for different tasks. As metrics for the importance of each package, we consider its maturity and history, the number of total and average monthly downloads from CRAN (The Comprehensive R Archive Network), and the number of reverse dependencies. Then we briefly describe what each of these package does. After that we elaborate on the several above-mentioned topics of robust statistics, presenting the methodology and the implementation in R and illustrating the application on real data examples. Particular attention is paid to the robust methods and algorithms suitable for high-dimensional data. The code for all examples is accessible on the GitHub repository .
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Seasonal statistics: The 'seas' package for R
    Toews, Michael W.
    Whitfield, Paul H.
    Allen, Diana M.
    COMPUTERS & GEOSCIENCES, 2007, 33 (07) : 944 - 951
  • [2] The bayesvl package: An R package for implementing and visualizing Bayesian statistics
    La, Viet-Phuong
    Vuong, Quan-Hoang
    Tran, Trung
    Nguyen, Minh-Hoang
    Ho, Manh-Toan
    Ho, Manh-Toan
    SOFTWAREX, 2022, 20
  • [3] MadingleyR: An R package for mechanistic ecosystem modelling
    Hoeks, Selwyn
    Tucker, Marlee A.
    Huijbregts, Mark A. J.
    Harfoot, Mike B. J.
    Bithell, Mike
    Santini, Luca
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2021, 30 (09): : 1922 - 1933
  • [4] AlignStatPlot: An R package and online tool for robust sequence alignment statistics and innovative visualization of big data
    Alsamman, Alsamman M.
    El Allali, Achraf
    Mokhtar, Morad M.
    Al-Sham'aa, Khaled
    Nassar, Ahmed E.
    Mousa, Khaled H.
    Kehel, Zakaria
    PLOS ONE, 2023, 18 (09):
  • [5] RESI: An R Package for Robust Effect Sizes
    Jones, Megan
    Kang, Kaidi
    Vandekar, Simon
    JOURNAL OF STATISTICAL SOFTWARE, 2025, 112 (03): : 1 - 27
  • [6] geneHapR: an R package for gene haplotypic statistics and visualization
    Renliang Zhang
    Guanqing Jia
    Xianmin Diao
    BMC Bioinformatics, 24
  • [7] Robust Mediation Analysis: The R Package robmed
    Alfons, Andreas
    Ates, Nufer Y.
    Groenen, Patrick J. F.
    JOURNAL OF STATISTICAL SOFTWARE, 2022, 103 (13): : 1 - 45
  • [8] geneHapR: an R package for gene haplotypic statistics and visualization
    Zhang, Renliang
    Jia, Guanqing
    Diao, Xianmin
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [9] LearningRlab: Educational R Package for Statistics in Computer Science Engineering
    Cuadrado-Gallego, Juan J.
    Gomez, Josefa
    Tayebi, Abdelhamid
    Usero, Luis
    Hellin, Carlos J.
    Valledor, Adrian
    SUSTAINABILITY, 2023, 15 (10)
  • [10] Meta-Statistics for Variable Selection: The R Package BioMark
    Wehrens, Ron
    Franceschi, Pietro
    JOURNAL OF STATISTICAL SOFTWARE, 2012, 51 (10): : 1 - 18