Transformer-based Pouranic topic classification in Indian mythology

被引:0
|
作者
Paul, Apurba [1 ,3 ]
Seal, Srijan [2 ]
Das, Dipankar [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[2] JIS Coll Engn, Dept Comp Sci & Engn, Kalyani, India
[3] Univ Engn & Management, Inst Engn & Management, Dept Comp Sci & Engn, Kolkata, India
关键词
Topic classification; Indian mythology; transformer models; semantic similarity; log-likelihood; Pouranic;
D O I
10.1007/s12046-024-02598-6
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Topic classification is a challenging task in order to comprehend the subject matter or theme of the Indian mythology. It will enhance the performance of NLP-based systems, such as recommendation and semantic search engines, when dealing with texts containing mythology. This research focuses on developing transformer based models for automated topic classification of Indian mythological documents, which addresses the challenges of organizing and analyzing this rich and diverse corpus. We introduce PouranicTopic, a new annotated dataset containing over 200k verses from 7 major Hindu texts with canto, topic, and sentence labels. Additional datasets Similarity-based and Log-likelihood-based are created using sentence clustering techniques. The BERT, RoBERTa, and DistilBERT models are evaluated for canto and topic classification on these datasets. Clustering greatly improves the results on the Similarity-based dataset, but Log-likelihood-based dataset remains challenging.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A Temporal Transformer-Based Fusion Framework for Morphological Arrhythmia Classification
    Anjum, Nafisa
    Sathi, Khaleda Akhter
    Hossain, Md. Azad
    Dewan, M. Ali Akber
    COMPUTERS, 2023, 12 (03)
  • [42] Transformer-Based Composite Language Models for Text Evaluation and Classification
    Skoric, Mihailo
    Utvic, Milos
    Stankovic, Ranka
    MATHEMATICS, 2023, 11 (22)
  • [43] A transformer-based deep neural network model for SSVEP classification
    Chen, Jianbo
    Zhang, Yangsong
    Pan, Yudong
    Xu, Peng
    Guan, Cuntai
    NEURAL NETWORKS, 2023, 164 : 521 - 534
  • [44] Development of a Text Classification Framework using Transformer-based Embeddings
    Yeasmin, Sumona
    Afrin, Nazia
    Saif, Kashfia
    Huq, Mohammad Rezwanul
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 74 - 82
  • [45] TransFGVC: transformer-based fine-grained visual classification
    Shen, Longfeng
    Hou, Bin
    Jian, Yulei
    Tu, Xisong
    Zhang, Yingjie
    Shuai, Lingying
    Ge, Fangzhen
    Chen, Debao
    VISUAL COMPUTER, 2025, 41 (04): : 2439 - 2459
  • [46] Hybrid Swin Transformer-Based Classification of Gaze Target Regions
    Wu, Gongpu
    Wang, Changyuan
    Gao, Lina
    Xue, Jinna
    IEEE ACCESS, 2023, 11 : 132055 - 132067
  • [47] Transformer-based sensor failure prediction and classification framework for UAVs
    Ahmad, Muhammad Waqas
    Akram, Muhammad Usman
    Mohsan, Mashood Mohammad
    Saghar, Kashif
    Ahmad, Rashid
    Butt, Wasi Haider
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [48] Transformer-based unsupervised contrastive learning for histopathological image classification
    Wang, Xiyue
    Yang, Sen
    Zhang, Jun
    Wang, Minghui
    Zhang, Jing
    Yang, Wei
    Huang, Junzhou
    Han, Xiao
    MEDICAL IMAGE ANALYSIS, 2022, 81
  • [49] A Study on Performance Enhancement by Integrating Neural Topic Attention with Transformer-Based Language Model
    Um, Taehum
    Kim, Namhyoung
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [50] Comparative Analysis of Community Detection and Transformer-Based Approaches for Topic Clustering of Scientific Papers
    Bretsko, Daniel
    Belyi, Alexander
    Sobolevsky, Stanislav
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2023, PT I, 2023, 13956 : 648 - 660