Transformer-based Pouranic topic classification in Indian mythology

被引:0
|
作者
Paul, Apurba [1 ,3 ]
Seal, Srijan [2 ]
Das, Dipankar [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[2] JIS Coll Engn, Dept Comp Sci & Engn, Kalyani, India
[3] Univ Engn & Management, Inst Engn & Management, Dept Comp Sci & Engn, Kolkata, India
关键词
Topic classification; Indian mythology; transformer models; semantic similarity; log-likelihood; Pouranic;
D O I
10.1007/s12046-024-02598-6
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Topic classification is a challenging task in order to comprehend the subject matter or theme of the Indian mythology. It will enhance the performance of NLP-based systems, such as recommendation and semantic search engines, when dealing with texts containing mythology. This research focuses on developing transformer based models for automated topic classification of Indian mythological documents, which addresses the challenges of organizing and analyzing this rich and diverse corpus. We introduce PouranicTopic, a new annotated dataset containing over 200k verses from 7 major Hindu texts with canto, topic, and sentence labels. Additional datasets Similarity-based and Log-likelihood-based are created using sentence clustering techniques. The BERT, RoBERTa, and DistilBERT models are evaluated for canto and topic classification on these datasets. Clustering greatly improves the results on the Similarity-based dataset, but Log-likelihood-based dataset remains challenging.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Vision Transformer-Based Ensemble Learning for Hyperspectral Image Classification
    Liu, Jun
    Guo, Haoran
    He, Yile
    Li, Huali
    REMOTE SENSING, 2023, 15 (21)
  • [32] Transformer-Based Fused Attention Combined with CNNs for Image Classification
    Jielin Jiang
    Hongxiang Xu
    Xiaolong Xu
    Yan Cui
    Jintao Wu
    Neural Processing Letters, 2023, 55 : 11905 - 11919
  • [33] Online Feature Classification and Clustering for Transformer-based Visual Tracker
    Zou, Zhuojun
    Hao, Jie
    Shu, Lin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3514 - 3521
  • [34] Transformer-Based Fused Attention Combined with CNNs for Image Classification
    Jiang, Jielin
    Xu, Hongxiang
    Xu, Xiaolong
    Cui, Yan
    Wu, Jintao
    NEURAL PROCESSING LETTERS, 2023, 55 (09) : 11905 - 11919
  • [35] Transformer-based networks over tree structures for code classification
    Wei Hua
    Guangzhong Liu
    Applied Intelligence, 2022, 52 : 8895 - 8909
  • [36] Classification and recognition of gesture EEG signals with Transformer-Based models
    Qu, Yan
    Li, Congsheng
    Jiang, Haoyu
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 415 - 418
  • [37] Transformer-Based BiLSTM for Aspect-Level Sentiment Classification
    Cai, Tao
    Yu, Baocheng
    Xu, Wenxia
    2021 4TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RCAE 2021), 2021, : 138 - 142
  • [38] Transformer-based networks over tree structures for code classification
    Hua, Wei
    Liu, Guangzhong
    APPLIED INTELLIGENCE, 2022, 52 (08) : 8895 - 8909
  • [39] A hierarchical transformer-based network for multivariate time series classification
    Tang, Yingxia
    Wei, Yanxuan
    Li, Teng
    Zheng, Xiangwei
    Ji, Cun
    INFORMATION SYSTEMS, 2025, 132
  • [40] Transformer-Based Spiking Neural Networks for Multimodal Audiovisual Classification
    Guo, Lingyue
    Gao, Zeyu
    Qu, Jinye
    Zheng, Suiwu
    Jiang, Runhao
    Lu, Yanfeng
    Qiao, Hong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 1077 - 1086