Multimodal Chinese Agricultural News Classification Method Based on Interactive Attention

被引:1
|
作者
Duan, Xuliang [1 ]
Li, Zhiyao [2 ]
Liu, Lingqi
Liu, Yuhai
机构
[1] Sichuan Agr Univ, Sch Informat Engn, Yaan 625014, Sichuan, Peoples R China
[2] Key Lab Agr Informat Engn Sichuan Prov, Yaan 625014, Sichuan, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Fake news; Data models; Agricultural machinery; Visualization; Training; Attention mechanisms; Semantics; Fisheries; Annotations; Multimedia computing; Multimodal learning; multimodal classification; multimodal Chinese agricultural news dataset; interactive attention mechanism; attention mechanism; feature fusion; Chinese agricultural news classification; Chinese agricultural news;
D O I
10.1109/ACCESS.2024.3482868
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most current research on Chinese agricultural news is limited to text analysis and seldom integrates images, leading to a scarcity of multimodal Chinese agricultural news datasets and an evident gap in multimodal Chinese agricultural news research. To address this, we propose the VECO method, a novel multimodal Chinese agricultural news classification approach that leverages interactive attention mechanisms. This algorithm uses ERNIE for text feature extraction and ViT(Vision Transformer) for image feature extraction, focusing on the interplay of features across modalities to uncover the congruent emotional content present in both the images and text. The integrated features are merged with individual image and text features and subsequently processed through a softmax layer to determine the classification outcomes. Our experiments, conducted on an in-house multimodal Chinese agricultural news dataset, demonstrate that the VECO method outperforms the baseline model, with improvements of 3.27% in precision, 0.59% in recall, and 1.92% in f1-score. The multimodal classification of Chinese agricultural news yields superior performance compared to text-only classification, and the results of the VECO model are notably better than those of other multimodal classification models. Future research can focus on optimizing the multimodal feature fusion algorithm to adapt to more complex agricultural news scenarios.
引用
收藏
页码:161718 / 161731
页数:14
相关论文
共 50 条
  • [21] A Sentiment-Based Multimodal Method to Detect Fake News
    Libonati Maia, Igor Maffei
    de Souza, Marcelo Pereira
    Matias da Silva, Flavio Roberto
    Souza Freire, Paulo Marcio
    Goldschmidt, Ronaldo Ribeiro
    PROCEEDINGS OF THE 27TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA '21), 2021, : 212 - 215
  • [22] Multimodal attention-based deep learning for automatic modulation classification
    Han, Jia
    Yu, Zhiyong
    Yang, Jian
    FRONTIERS IN ENERGY RESEARCH, 2023, 10
  • [23] Multimodal topic segmentation and classification of news video
    Raaijmakers, S
    den Hartog, J
    Baan, J
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A33 - A36
  • [24] An Effective Approach for Chinese News Headline Classification Based on Multi-representation Mixed Model with Attention and Ensemble Learning
    Lu, Zhonglei
    Liu, Wenfen
    Zhou, Yanfang
    Hu, Xuexian
    Wang, Binyu
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 339 - 350
  • [25] A Multimodal Fingers Classification for General Interactive Surfaces
    Bevilacqua, Vitoantonio
    Barone, Donato
    Suma, Marco
    INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 513 - 521
  • [26] A two-branch multimodal fake news detection model based on multimodal bilinear pooling and attention mechanism
    Guo, Ying
    Ge, Hong
    Li, Jinhong
    FRONTIERS IN COMPUTER SCIENCE, 2023, 5
  • [27] Research on Chinese News Text Classification Based on ERNIE Model
    Zhang, Wenxu
    PROCEEDINGS OF THE WORLD CONFERENCE ON INTELLIGENT AND 3-D TECHNOLOGIES, WCI3DT 2022, 2023, 323 : 89 - 100
  • [28] Enhancing classification effectiveness of Chinese news based on term frequency
    Chan, Tzu-Yi
    Chang, Yue-Shan
    2017 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CLOUD AND SERVICE COMPUTING (SC2 2017), 2017, : 124 - 131
  • [29] Vague events-based Chinese Web news classification
    Hu, W
    Zhang, DM
    Sheng, HY
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1545 - 1549
  • [30] Interactive Multimodal Attention Network for Emotion Recognition in Conversation
    Ren, Minjie
    Huang, Xiangdong
    Shi, Xiaoqi
    Nie, Weizhi
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1046 - 1050