A Novel Approach using Vision Transformers (VIT) for Classification of Holes Drilled in Melamine Faced Chipboard

被引:0
|
作者
Bukowski, Michal [1 ]
Jegorowa, Albina [2 ]
Kurek, Jaroslaw [1 ]
机构
[1] Warsaw Univ Life Sci, Inst Informat Technol, Dept Artificial Intelligence, Warsaw, Poland
[2] Warsaw Univ Life Sci, Inst Wood Sci & Furniture, Dept Mech Proc Wood, Warsaw, Poland
来源
PRZEGLAD ELEKTROTECHNICZNY | 2024年 / 100卷 / 05期
关键词
Vision Transformer; Convolutional Neural Network; tool state monitoring; melamine faced chipboard;
D O I
10.15199/48.2024.05.52
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a comprehensive performance evaluation of various AI architectures for a classification of holes drilled in melamine faced chipboard, including custom Convolutional Neural Network (CNN-designed), five-fold CNN-designed, VGG19, single and five-fold VGG16, an ensemble of CNN-designed, VGG19, and 5xVGG16, and Vision Transformers (ViT). Each model's performance was measured and compared based on their classification accuracy, with the Vision Transformer models, particularly the B_32 model trained for 8000 epochs, demonstrating superior performance with an accuracy of 71.14%. Despite this achievement, the study underscores the need to balance model performance with other considerations such as computational resources, model complexity, and training times. The results highlight the importance of careful model selection and fine-tuning, guided not only by performance metrics but also by the specific requirements and constraints of the task and context. The study provides a strong foundation for further exploration into other transformer-based models and encourages deeper investigations into model fine-tuning to harness the full potential of these AI architectures for image classification tasks.
引用
收藏
页码:273 / 276
页数:4
相关论文
共 50 条
  • [31] Adaptive Knowledge Distillation for Classification of Hand Images Using Explainable Vision Transformers
    Thanh Thi Nguyen
    Wilson, Campbell
    Dalins, Janis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK AND DEMO TRACK, PT VIII, ECML PKDD 2024, 2024, 14948 : 235 - 252
  • [32] A Novel Approach to Age Classification from Hand Dorsal Images using Computer Vision
    Chakrabarty, Navoneel
    Chatterjee, Subhrasankar
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 198 - 202
  • [33] A novel approach to the classification of the transient phenomena in power transformers using combined wavelet transform and neural network
    Mao, PLL
    Aggarwal, RK
    IEEE TRANSACTIONS ON POWER DELIVERY, 2001, 16 (04) : 654 - 660
  • [34] A Novel Approach for Calibration of Instrument Transformers using Synchrophasors
    Hinge, Trupti P.
    Dambhare, Sanjay. S.
    2016 NATIONAL POWER SYSTEMS CONFERENCE (NPSC), 2016,
  • [35] AnisotropicBreast-ViT: Breast Cancer Classification in Ultrasound Images Using Anisotropic Filtering and Vision Transformer
    Diniz, Joao Otavio Bandeira
    Ribeiro, Neilson P.
    Dias, Domingos A., Jr.
    da Cruz, Luana B.
    da Silva, Giovanni L. F.
    Gomes, Daniel L., Jr.
    de Paiva, Anselmo C.
    Silva, Aristofanes C.
    INTELLIGENT SYSTEMS, BRACIS 2024, PT III, 2025, 15414 : 95 - 109
  • [36] Towards Understanding Cat Vocalizations: A Novel Cat Sound Classification Model Based on Vision Transformers
    Kucukkulahli, Enver
    Kabakus, Abdullah Talha
    APPLIED ACOUSTICS, 2024, 226
  • [37] Through-Ice Acoustic Source Tracking Using Vision Transformers with Ordinal Classification
    Whitaker, Steven
    Barnard, Andrew
    Anderson, George D.
    Havens, Timothy C.
    SENSORS, 2022, 22 (13)
  • [38] Automatic classification of ultrasound thyroids images using vision transformers and generative adversarial networks
    Jerbi, Feres
    Aboudi, Noura
    Khlifa, Nawres
    SCIENTIFIC AFRICAN, 2023, 20
  • [39] Classification of Brain Tumor from Magnetic Resonance Imaging Using Vision Transformers Ensembling
    Tummala, Sudhakar
    Kadry, Seifedine
    Bukhari, Syed Ahmad Chan
    Rauf, Hafiz Tayyab
    CURRENT ONCOLOGY, 2022, 29 (10) : 7498 - 7511
  • [40] ETLoViT: an acne diagnose approach using vision-transformers and model ensembling
    Paluri, Krishna Veni
    Gupta, Ashish
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (03):