Multimodal Chinese Agricultural News Classification Method Based on Interactive Attention

被引:1
|
作者
Duan, Xuliang [1 ]
Li, Zhiyao [2 ]
Liu, Lingqi
Liu, Yuhai
机构
[1] Sichuan Agr Univ, Sch Informat Engn, Yaan 625014, Sichuan, Peoples R China
[2] Key Lab Agr Informat Engn Sichuan Prov, Yaan 625014, Sichuan, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Fake news; Data models; Agricultural machinery; Visualization; Training; Attention mechanisms; Semantics; Fisheries; Annotations; Multimedia computing; Multimodal learning; multimodal classification; multimodal Chinese agricultural news dataset; interactive attention mechanism; attention mechanism; feature fusion; Chinese agricultural news classification; Chinese agricultural news;
D O I
10.1109/ACCESS.2024.3482868
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most current research on Chinese agricultural news is limited to text analysis and seldom integrates images, leading to a scarcity of multimodal Chinese agricultural news datasets and an evident gap in multimodal Chinese agricultural news research. To address this, we propose the VECO method, a novel multimodal Chinese agricultural news classification approach that leverages interactive attention mechanisms. This algorithm uses ERNIE for text feature extraction and ViT(Vision Transformer) for image feature extraction, focusing on the interplay of features across modalities to uncover the congruent emotional content present in both the images and text. The integrated features are merged with individual image and text features and subsequently processed through a softmax layer to determine the classification outcomes. Our experiments, conducted on an in-house multimodal Chinese agricultural news dataset, demonstrate that the VECO method outperforms the baseline model, with improvements of 3.27% in precision, 0.59% in recall, and 1.92% in f1-score. The multimodal classification of Chinese agricultural news yields superior performance compared to text-only classification, and the results of the VECO model are notably better than those of other multimodal classification models. Future research can focus on optimizing the multimodal feature fusion algorithm to adapt to more complex agricultural news scenarios.
引用
收藏
页码:161718 / 161731
页数:14
相关论文
共 50 条
  • [41] Interactive Fusion Network with Recurrent Attention for Multimodal Aspect-based Sentiment Analysis
    Wang, Jun
    Wang, Qianlong
    Wen, Zhiyuan
    Liang, Xingwei
    Xu, Ruifeng
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 298 - 309
  • [42] Multimodal Fusion Method Based on Self-Attention Mechanism
    Zhu, Hu
    Wang, Ze
    Shi, Yu
    Hua, Yingying
    Xu, Guoxia
    Deng, Lizhen
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [43] Multimodal Material Classification Using Visual Attention
    Maleki, Mohadeseh
    Rouhafzay, Ghazal
    Cretu, Ana-Maria
    SENSORS, 2024, 24 (23)
  • [44] Multimodal Keyless Attention Fusion for Video Classification
    Long, Xiang
    Gan, Chuang
    de Melo, Gerard
    Liu, Xiao
    Li, Yandong
    Li, Fu
    Wen, Shilei
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7202 - 7209
  • [45] Classification technique of chinese agricultural text information based on SVM
    College of Information and Electrical Engineering, China Agricultural University, Beijing
    100083, China
    不详
    100097, China
    Nongye Jixie Xuebao, (174-179):
  • [46] Named Entity Recognition of Chinese Agricultural Text Based on Attention Mechanism
    Zhao, Pengfei
    Zhao, Chunjiang
    Wu, Huarui
    Wang, Wei
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (01): : 185 - 192
  • [47] A Method of Educational News Classification Based on Emotional Dictionary
    Wang, Bin
    Gao, Linli
    An, Tao
    Meng, Mei
    Zhang, Tong
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 3547 - 3551
  • [48] A Hierarchy Method Based on LDA and SVM for News Classification
    Cui, Limeng
    Meng, Fan
    Shi, Yong
    Li, Minqiang
    Liu, An
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 60 - 64
  • [49] Multimodal Taste Classification of Chinese Recipe Based on Image and Text Fusion
    Chen Yawei
    Cao Min
    Gao Wenjing
    2020 5TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2020), 2020, : 203 - 208
  • [50] Multimodal fusion sensitive information classification based on mixed attention and CLIP model
    Huang, Shuaina
    Zhang, Zhiyong
    Song, Bin
    Mao, Yueheng
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 12425 - 12437