Cluster analysis for the selection of potential discriminatory variables and the identification of subgroups in archaeometry

被引:3
|
作者
Lopez-Garcia, Pedro A. [1 ]
Argote, Denisse L. [2 ]
机构
[1] Escuela Nacl Antropol Hist, Posgrad Arqueol, Perifer Sur Esq,Calle Zapote,Col Isidro Fabela, Mexico City, Mexico
[2] Inst Nacl Antropol & Hist, Direcc Estudios Arqueol, Tacuba 76,Colonia Ctr, Mexico City, Mexico
关键词
Archaeological glass; High-dimensional data; Dimensionality reduction; Feature selection; Databionic Swarm; Datavisualization; COMPOSITIONAL DATA-ANALYSIS; R PACKAGE; MODEL; GLASS; CLASSIFICATION; KNOWLEDGE; ANTWERP;
D O I
10.1016/j.jasrep.2023.104022
中图分类号
K85 [文物考古];
学科分类号
0601 ;
摘要
In this article, three variable selection methods based on Gaussian mixture models were compared to find a subset of variables that provided the "best" clustering. The use of an appropriate transformation for composi-tional data, whose geometric space is the Simplex, is emphasized. The comparison revealed the ability of the models to cluster data in multiple phases, showing to be more convenient to select the relevant variables than to perform an analysis based on 2D plots or by simultaneously including all the available variables in a multivariate analysis. Once the informative variables for the clustering were obtained, we used a method called Databionic Swarm (DBS). This method uses unsupervised machine learning, taking advantage of emergence and swarm intelligence applied to find natural chemical groups in the input data space. DBS can visualize high-dimensional distances in the projection through a 3D topographic map with hypsometric tints. The results were compared in terms of accuracy, both in the selection of the variables and in the classification, using a supervised accuracy index for clustering and two unsupervised indexes (the Heatmap and the Silhouette plot). The concepts and methods were illustrated by applying them to two published archaeological glass data sets. The first set consisted of 245 Romano-British glass vessels and the second set of 180 glass vessels from the 15th-17th century in Antwerp. In these applications, it was found that the methods for the selection of variables increased the ac-curacy of the classification compared to traditional methods.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Identifying Tinnitus Subgroups With Cluster Analysis
    Tyler, Richard
    Coelho, Claudia
    Tao, Pan
    Ji, Haihong
    Noble, William
    Gehringer, Anne
    Gogel, Stephanie
    AMERICAN JOURNAL OF AUDIOLOGY, 2008, 17 (02) : S176 - S184
  • [22] Three distinct subgroups of male obstructive sleep apnoea patients by cluster analysis based on polysomnographic variables
    Nakayama, H.
    Kobayashi, M.
    Yanagihara, M.
    Tsuiki, S.
    Kasagi, S.
    Inoue, Y.
    Setoguchi, Y.
    JOURNAL OF SLEEP RESEARCH, 2016, 25 : 257 - 257
  • [23] DISCRIMINATORY ANALYSIS AND ITS APPLICATION TO THE IDENTIFICATION OF VOWELS
    STUBBS, HL
    BRIANA, AM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1953, 25 (04): : 823 - 823
  • [24] Identification of subgroups with differential treatment effects for longitudinal and multiresponse variables
    Loh, Wei-Yin
    Fu, Haoda
    Man, Michael
    Champion, Victoria
    Yu, Menggang
    STATISTICS IN MEDICINE, 2016, 35 (26) : 4837 - 4855
  • [25] Trail Making Test Performance in OEF/OIF Veterans: Identification of Subgroups Based on Cluster Analysis
    Thaler, N.
    Linck, J.
    Heyanka, D.
    Pastorek, N.
    Miller, B.
    Romesser, J.
    Sim, A.
    Allen, D.
    ARCHIVES OF CLINICAL NEUROPSYCHOLOGY, 2013, 28 (06) : 570 - 570
  • [26] CERAMIC ARCHAEOMETRY AND A TRADITIONAL MEDITERRANEAN PROBLEM - SOME USEFUL VARIABLES FOR THE ANALYSIS OF EXTINCT ECONOMIES
    PURCELL, GD
    AMERICAN JOURNAL OF ARCHAEOLOGY, 1994, 98 (02) : 338 - 338
  • [27] Selection by cluster analysis
    Dogan, I
    TURKISH JOURNAL OF VETERINARY & ANIMAL SCIENCES, 2002, 26 (01): : 47 - 53
  • [28] Identifying Subgroups of Complex Patients With Cluster Analysis
    Newcomer, Sophia R.
    Steiner, John F.
    Bayliss, Elizabeth A.
    AMERICAN JOURNAL OF MANAGED CARE, 2011, 17 (08): : E324 - E332
  • [30] APPLICATION OF CLUSTER ANALYSIS FOR IDENTIFICATION OF VARIABLES ASSOCIATED WITH ACADEMIC SUCCESS OF HIGHER EDUCATION STUDENTS
    Zaldivar-Colado, Anibal
    Estrada-Lizarraga, Rogelio
    Nava-Perez, Lorena
    Mendoza-Zatarain, Rafael
    Aguilar-Gonzalez, Celina
    5TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI 2012), 2012, : 2170 - 2176