Multi-feature language-image model for fruit quality image classification

被引:0
|
作者
Duan, Jie-li [1 ]
Lai, Li-qian [1 ,3 ]
Yang, Zhou [1 ,2 ]
Luo, Zhi-jian [3 ]
Yuan, Hao-tian [1 ]
机构
[1] South China Agr Univ, Coll Engn, Guangzhou 510642, Guangdong, Peoples R China
[2] Guangdong Ocean Univ, Sch Mech Engn, Zhanjiang 524088, Guangdong, Peoples R China
[3] Jiaying Univ, Coll Comp Sci, Meizhou 514015, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial intelligence; Fruit quality classification; Language-image model; Few-shot learning;
D O I
10.1016/j.compag.2024.109462
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Fruit quality classification has a great impact on the modern fruit industry. However, deep learning methods for fruit quality classification often demand a substantial number of labeled samples, which are hard and expensive to collect in many real-world applications, resulting in overfitting and low generalization. The Contrastive Language-Image Pre-Training (CLIP) model, which fuses image and text features, has demonstrated excellent performance in zero-shot classification. Inspired by CLIP, in this paper, we propose a multi-feature language-image (MFLI) model for fruit quality classification, where the fruit image and feature text are fused to enhance feature extraction. Furthermore, we construct a pomelo quality dataset containing first- and secondgrade pomelo. Based on the zero-shot learning results of CLIP on this dataset, we provide recommendations for pre-prompt and multi-feature text. Experimental results show that in both zero-shot, few-shot, and conventional learning sceneries, our MFLI model outperforms state-of-the-art models on seven types of fruits, demonstrating excellent generalization capabilities.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-Feature Fusion and Adaptive Kernel Combination for SAR Image Classification
    Wu, Xiaoying
    Wen, Xianbin
    Xu, Haixia
    Yuan, Liming
    Guo, Changlun
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 23
  • [22] Urban classification by multi-feature fusion of hyperspectral image and LiDAR data
    Cao Q.
    Ma A.
    Zhong Y.
    Zhao J.
    Zhao B.
    Zhang L.
    Yaogan Xuebao/Journal of Remote Sensing, 2019, 23 (05): : 892 - 903
  • [23] ONLINE SPARSE LEARNING UTILIZING MULTI-FEATURE COMBINATION FOR IMAGE CLASSIFICATION
    Zhang, Lihe
    Zhang, Kunyu
    Dong, Xiaoli
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 197 - 200
  • [24] Aurora Image Classification Based on Multi-Feature Latent Dirichlet Allocation
    Zhong, Yanfei
    Huang, Rui
    Zhao, Ji
    Zhao, Bei
    Liu, Tingting
    REMOTE SENSING, 2018, 10 (02)
  • [25] Image Classification with Local Linear Decoding and Global Multi-feature Fusion
    Hong, Zhang
    Ping, Wu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT II, 2015, 9315 : 437 - 446
  • [26] An Image Classification Method Based On Multi-feature Fusion and Multi-kernel SVM
    Xiang, Zixi
    Lv, Xueqiang
    Zhang, Kai
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [27] Hyperspectral Image Classification Based on Dense Pyramidal Convolution and Multi-Feature Fusion
    Zhang, Junsan
    Zhao, Li
    Jiang, Hongzhao
    Shen, Shigen
    Wang, Jian
    Zhang, Peiying
    Zhang, Wei
    Wang, Leiquan
    REMOTE SENSING, 2023, 15 (12)
  • [28] Multi-feature Classification of Hyperspectral Image via Probabilistic SVM and Guided Filter
    Zhang, Chengkun
    Han, Min
    Xu, Meiling
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [29] JOINT MULTI-FEATURE HYPERSPECTRAL IMAGE CLASSIFICATION WITH SPATIAL CONSTRAINT IN SEMANTIC MANIFOLD
    Zhang, Xiangrong
    Gao, Zeyu
    An, Jinliang
    Hu, Yanning
    Li, Yangyang
    Hou, Biao
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 481 - 484
  • [30] iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual Recognition
    Wei, Yixuan
    Cao, Yue
    Zhang, Zheng
    Peng, Houwen
    Yao, Zhuliang
    Xie, Zhenda
    Hue, Han
    Guo, Baining
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2776 - 2786