Cluster analysis for the selection of potential discriminatory variables and the identification of subgroups in archaeometry

被引:3
|
作者
Lopez-Garcia, Pedro A. [1 ]
Argote, Denisse L. [2 ]
机构
[1] Escuela Nacl Antropol Hist, Posgrad Arqueol, Perifer Sur Esq,Calle Zapote,Col Isidro Fabela, Mexico City, Mexico
[2] Inst Nacl Antropol & Hist, Direcc Estudios Arqueol, Tacuba 76,Colonia Ctr, Mexico City, Mexico
关键词
Archaeological glass; High-dimensional data; Dimensionality reduction; Feature selection; Databionic Swarm; Datavisualization; COMPOSITIONAL DATA-ANALYSIS; R PACKAGE; MODEL; GLASS; CLASSIFICATION; KNOWLEDGE; ANTWERP;
D O I
10.1016/j.jasrep.2023.104022
中图分类号
K85 [文物考古];
学科分类号
0601 ;
摘要
In this article, three variable selection methods based on Gaussian mixture models were compared to find a subset of variables that provided the "best" clustering. The use of an appropriate transformation for composi-tional data, whose geometric space is the Simplex, is emphasized. The comparison revealed the ability of the models to cluster data in multiple phases, showing to be more convenient to select the relevant variables than to perform an analysis based on 2D plots or by simultaneously including all the available variables in a multivariate analysis. Once the informative variables for the clustering were obtained, we used a method called Databionic Swarm (DBS). This method uses unsupervised machine learning, taking advantage of emergence and swarm intelligence applied to find natural chemical groups in the input data space. DBS can visualize high-dimensional distances in the projection through a 3D topographic map with hypsometric tints. The results were compared in terms of accuracy, both in the selection of the variables and in the classification, using a supervised accuracy index for clustering and two unsupervised indexes (the Heatmap and the Silhouette plot). The concepts and methods were illustrated by applying them to two published archaeological glass data sets. The first set consisted of 245 Romano-British glass vessels and the second set of 180 glass vessels from the 15th-17th century in Antwerp. In these applications, it was found that the methods for the selection of variables increased the ac-curacy of the classification compared to traditional methods.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] A cluster selection approach to polynomial NARX identification
    Pulecchi, Tiziano
    Piroddi, Luigi
    2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 1993 - 1998
  • [42] Cluster Analysis to Identify Possible Subgroups in Tinnitus Patients
    van den Berge, Minke J. C.
    Free, Rolien H.
    Arnold, Rosemarie
    de Kleine, Emile
    Hofman, Rutger
    van Dijk, J. Marc C.
    van Dijk, Pim
    FRONTIERS IN NEUROLOGY, 2017, 8
  • [43] CLUSTER ANALYSIS OF CLINICAL DATA IDENTIFIES FIBROMYALGIA SUBGROUPS
    Docampo, E.
    Escaramis, G.
    Rabionet, R.
    Carbonell, J.
    Rivera, J.
    Alegre, J.
    Vidal, J.
    Estivill, X.
    Collado, A.
    ANNALS OF THE RHEUMATIC DISEASES, 2013, 71 : 76 - 76
  • [44] Identification of potential industrial cluster in the region
    Veza, I.
    Dulcic, Z.
    Gjeldum, N.
    Annals of DAAAM for 2006 & Proceedings of the 17th International DAAAM Symposium: INTELLIGENT MANUFACTURING & AUTOMATION: FOCUS ON MECHATRONICS AND ROBOTICS, 2006, : 429 - 430
  • [45] Identifying cognitive subgroups in bipolar disorder: A cluster analysis
    Lima, Flavia
    Rabelo-da-Ponte, Francisco Diego
    Bucker, Joana
    Czepielewski, Leticia
    Hasse-Sousa, Mathias
    Telesca, Raissa
    Sole, Brisa
    Reinares, Maria
    Vieta, Eduard
    Rosa, Adriane R.
    JOURNAL OF AFFECTIVE DISORDERS, 2019, 246 : 252 - 261
  • [46] The use of sensitivity analysis for selection of decision variables in machine tool dynamic models identification
    Gutowski, P
    Berczynski, S
    COMPUTATIONAL METHODS AND EXPERIMENTAL MEASUREMENTS X, 2001, 3 : 565 - 574
  • [47] Cluster Analysis of Clinical Data Identifies Fibromyalgia Subgroups
    Docampo, Elisa
    Collado, Antonio
    Escaramis, Georgia
    Carbonell, Jordi
    Rivera, Javier
    Vidal, Javier
    Alegre, Jose
    Rabionet, Raquel
    Estivill, Xavier
    PLOS ONE, 2013, 8 (09):
  • [48] Subgroups of children with autism by cluster analysis: A longitudinal examination
    Stevens, MC
    Fein, DA
    Dunn, M
    Allen, D
    Waterhouse, LH
    Feinstein, C
    Rapin, I
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2000, 39 (03): : 346 - 352
  • [49] Automated margin analysis of contemporary adhesive systems in vitro: Evaluation of discriminatory variables
    Heintzea, Siegward D.
    Forjanic, Monika
    Roulet, Jean-Francois
    JOURNAL OF ADHESIVE DENTISTRY, 2007, 9 (04): : 359 - 369
  • [50] Identification Of Clinical Asthma Phenotypes By Using Cluster Analysis With Simple Measurable Variables In Japanese Population
    Sakagami, T.
    Hasegawa, T.
    Koya, T.
    Furukawa, T.
    Kawakami, H.
    Hoshino, Y.
    Kimura, Y.
    Sakamoto, H.
    Suzuki, E.
    Narita, I.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2011, 183