Classifying real-world data with the DDα-procedure

被引:0
|
作者
Mozharovskyi, Pavlo [1 ]
Mosler, Karl [1 ]
Lange, Tatjana [2 ]
机构
[1] Univ Cologne, Albertus Magnus Pl, D-50923 Cologne, Germany
[2] Hsch Merseburg, D-06217 Merseburg, Germany
关键词
Classification; Supervised learning; Alpha-procedure; Data depth; Spatial depth; Projection depth; Random Tukey depth; Outsiders; Features; DATA DEPTH; CLASSIFICATION; REGRESSION;
D O I
10.1007/s11634-014-0180-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The -classifier, a nonparametric fast and very robust procedure, is described and applied to fifty classification problems regarding a broad spectrum of real-world data. The procedure first transforms the data from their original property space into a depth space, which is a low-dimensional unit cube, and then separates them by a projective invariant procedure, called -procedure. To each data point the transformation assigns its depth values with respect to the given classes. Several alternative depth notions (spatial depth, Mahalanobis depth, projection depth, and Tukey depth, the latter two being approximated by univariate projections) are used in the procedure, and compared regarding their average error rates. With the Tukey depth, which fits the distributions' shape best and is most robust, 'outsiders', that is data points having zero depth in all classes, appear. They need an additional treatment for classification. Evidence is also given about the dimension of the extended feature space needed for linear separation. The -procedure is available as an R-package.
引用
收藏
页码:287 / 314
页数:28
相关论文
共 50 条
  • [11] Assessing Real-World Data Quality: The Application of Patient Registry Quality Criteria to Real-World Data and Real-World Evidence
    Gliklich, Richard E.
    Leavy, Michelle B.
    THERAPEUTIC INNOVATION & REGULATORY SCIENCE, 2020, 54 (02) : 303 - 307
  • [12] Editorial: Real-world data and real-world evidence in lung cancer
    Gristina, Valerio
    Eze, Chukwuka
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [13] Inaccurate Real-World Data Does Not Provide Real-World Answers
    Buffet, Gabriela
    Mendoza-Sassi, Raul
    Fysekidis, Marinos
    AMERICAN JOURNAL OF THERAPEUTICS, 2021, 28 (05) : E596 - E598
  • [14] For insights into the real world, consider real-world data
    Raoof, Sana
    Kurzrock, Razelle
    SCIENCE TRANSLATIONAL MEDICINE, 2022, 14 (673)
  • [15] Editorial: Real-world data and real-world evidence in hematologic malignancies
    Malagola, Michele
    Ohgami, Robert
    Greco, Raffaella
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [16] When can real-world data generate real-world evidence?
    Rahman, Motiur
    Dal Pan, Gerald
    Stein, Peter
    Levenson, Mark
    Kraus, Stefanie
    Chakravarty, Aloka
    Rivera, Donna R.
    Forshee, Richard
    Concato, John
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 (01)
  • [17] Real-World or Controlled Clinical Trial Data in Real-World Practice
    Wu, Ting-Hui
    Yang, James Chih-Hsin
    JOURNAL OF THORACIC ONCOLOGY, 2018, 13 (04) : 470 - 472
  • [18] Learning With Real-World Data
    不详
    IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (05): : 158 - 159
  • [19] Reliability of real-world data
    Benlidayi, Ilke Coskun
    RHEUMATOLOGY INTERNATIONAL, 2019, 39 (03) : 583 - 584
  • [20] Real-World Data Modeling
    Kotanchek, Mark
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 1349 - 1378