Fish-TViT: A novel fish species classification method in multi water areas based on transfer learning and vision transformer

被引:16
|
作者
Gong, Bo [1 ,2 ,3 ]
Dai, Kanyuan [3 ,4 ,5 ]
Shao, Ji [1 ,2 ,4 ]
Jing, Ling [1 ,2 ,3 ,4 ,5 ]
Chen, Yingyi [1 ,2 ,3 ,4 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] China Agr Univ, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China
[3] Minist Agr & Rural Affairs, Key Lab Smart Farming Technol Aquat Anim & Livesto, Beijing 100083, Peoples R China
[4] China Agr Univ, Beijing Engn & Technol Res Ctr Internet Things Agr, Beijing 100083, Peoples R China
[5] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Fish species classification; Deep learning; Vision transformer; Transfer learning; IDENTIFICATION;
D O I
10.1016/j.heliyon.2023.e16761
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The classification of fish species has important practical significance for both the aquaculture industry and ordinary people. However, existing methods for classifying marine and freshwater fishes have poor feature extraction ability and do not meet actual needs. To address this issue, we propose a novel method for multi-water fish classification (Fish-TViT) based on transfer learning and visual transformers. Fish-TViT uses a label smoothing loss function to solve the problem of overfitting and overconfidence of the classifier. We also employ Gradient-weighted Category Activation Mapping (Grad-CAM) technology to visualize and understand the features of the model and the areas on which the decision depends, which guides the optimization of the model architecture. We first crop and clean fish images, and then use data augmentation to expand the number of training datasets. A pre-trained visual transformer model is used to extract enhanced features of fish images, which are subsequently cropped into a series of flat patches. Finally, a multi-layer perceptron is used to predict fish species. Experimental results show that Fish-TViT achieves high classification accuracy on both low-resolution marine fish data (94.33%) and high-resolution freshwater fish data (98.34%). Compared with traditional convolutional neural networks, Fish-TViT has better performance.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Design of educational method classification model based on improved multi-label transfer learning model
    Zeng, Chanjuan
    Zhao, Chunhui
    SOFT COMPUTING, 2023,
  • [42] Water Quality Prediction Method Based on Multi-Source Transfer Learning for Water Environmental IoT System
    Zhou, Jian
    Wang, Jian
    Chen, Yang
    Li, Xin
    Xie, Yong
    SENSORS, 2021, 21 (21)
  • [43] Predicting the Urban Water Demand Based on Transfer Learning Method With Multi-head Attention
    Chen, Zhuo
    Deng, Chuhan
    Che, Fei
    Li, Yan
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3760 - 3765
  • [44] An unsupervised transfer learning bearing fault diagnosis method based on multi-channel calibrated Transformer with shiftable window
    Zhi, Shaodan
    Su, Kaiyu
    Yu, Jun
    Li, Xueyi
    Shen, Haikuo
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2025,
  • [45] A Novel Obstacle Traversal Method for Multiple Robotic Fish Based on Cross-Modal Variational Autoencoders and Imitation Learning
    Wang, Ruilong
    Wang, Ming
    Zhao, Qianchuan
    Gong, Yanling
    Zuo, Lingchen
    Zheng, Xuehan
    Gao, He
    BIOMIMETICS, 2024, 9 (04)
  • [46] Text classification based on a novel cost-sensitive ensemble multi-label learning method
    Hu, Haifeng
    Zhang, Tao
    Wu, Jiansheng
    Journal of Software Engineering, 2016, 10 (01): : 42 - 53
  • [47] Data driven deep learning fault diagnosis method based on vision transformer and multi-head attention for different working condition
    Lu, Jingyu
    Ji, Weixi
    Yu, Junjie
    Zhang, Chaoyang
    ENGINEERING RESEARCH EXPRESS, 2025, 7 (01):
  • [48] Enhancing multi-type fault diagnosis in lithium-ion battery systems: Vision transformer-based transfer learning approach
    Liu, Xuyang
    Cai, Hongchang
    Zhou, Zihan
    Kong, Ye
    Zhou, Xingyu
    Han, Xuebing
    Sun, Yuedong
    Zhang, Bowen
    Guo, Dongxu
    Zheng, Yuejiu
    JOURNAL OF POWER SOURCES, 2024, 624
  • [49] A novel method for the detection and classification of multiple diseases using transfer learning-based deep learning techniques with improved performance
    Natarajan, Krishnamoorthy
    Muthusamy, Suresh
    Sha, Mizaj Shabil
    Sadasivuni, Kishor Kumar
    Sekaran, Sreejith
    Charles Gnanakkan, Christober Asir Rajan
    A.Elngar, Ahmed
    Neural Computing and Applications, 2024, 36 (30) : 18979 - 18997
  • [50] Multi-class Classification of Retinal Eye Diseases from Ophthalmoscopy Images Using Transfer Learning-Based Vision Transformers
    Cutur, Elif Setenay
    Inan, Neslihan Gokmen
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025,