Towards Better Sentiment Analysis in the Turkish Language: Dataset Improvements and Model Innovations

被引:0
|
作者
Zumberoglu, Kevser Busra [1 ,2 ]
Dik, Sumeyye Zulal [1 ,2 ]
Karadeniz, Busra Sinem [1 ,2 ]
Sahmoud, Shaaban [1 ,2 ]
机构
[1] Fatih Sultan Mehmet Vakif Univ, Comp Engn Dept, TR-34015 Istanbul, Turkiye
[2] Fatih Sultan Mehmet Vakif Univ, Data Sci Applicat & Res Ctr VEBIM, TR-34015 Istanbul, Turkiye
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
关键词
sentiment analysis; turkish language; turkish sentiment analysis; BERT;
D O I
10.3390/app15042062
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sentiment analysis in the Turkish language has gained increasing attention due to the growing availability of Turkish textual data across various domains. However, existing datasets often suffer from limitations such as insufficient size, lack of diversity, and annotation inconsistencies, which hinder the development of robust and accurate sentiment analysis models. In this study, we present a novel enhanced dataset specifically designed to address these challenges, providing a comprehensive and high-quality resource for Turkish sentiment analysis. We perform a comparative evaluation of previously proposed models using our dataset to assess their performance and limitations. Experimental findings demonstrate the effectiveness of the presented dataset and trained models, offering valuable insights for advancing sentiment analysis research in the Turkish language. These results underscore the critical role of the enhanced dataset in bridging the gap between existing datasets and the importance of training the modern sentiment analysis models on scalable, balanced, and curated datasets. This can offer valuable insights for advancing sentiment analysis research in the Turkish language. Furthermore, the experimental results represent an important step in overcoming the challenges associated with Turkish sentiment analysis and improving the performance of existing models.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Development of a Multilingual Model for Machine Sentiment Analysis in the Serbian Language
    Draskovic, Drazen
    Zecevic, Darinka
    Nikolic, Bosko
    MATHEMATICS, 2022, 10 (18)
  • [22] DeBERTa-GRU: Sentiment Analysis for Large Language Model
    Assiri, Adel
    Gumaei, Abdu
    Mehmood, Faisal
    Abbas, Touqeer
    Ullah, Sami
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4219 - 4236
  • [23] Large language model for Bible sentiment analysis: Sermon on the Mount
    Vora, Mahek
    Blau, Tom
    Kachhwal, Vansh
    Solo, Ashu M.G.
    Chandra, Rohitash
    arXiv, 2024,
  • [24] Bidirectional transfer learning model for sentiment analysis of natural language
    Malhotra, Shivani
    Kumar, Vinay
    Agarwal, Alpana
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (11) : 10267 - 10287
  • [25] Natural Language Processing for the Analysis Sentiment using a LSTM Model
    Berrajaa, Achraf
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 777 - 785
  • [26] Towards Cross-Language Sentiment Analysis through Universal Star Ratings
    Hogenboom, Alexander
    Bal, Malissa
    Frasincar, Flavius
    Bal, Daniella
    7TH INTERNATIONAL CONFERENCE ON KNOWLEDGE MANAGEMENT IN ORGANIZATIONS: SERVICE AND CLOUD COMPUTING, 2013, 172 : 69 - 79
  • [27] Towards better decoding and language model integration in sequence to sequence models
    Chorowski, Jan
    Jaitly, Navdeep
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 523 - 527
  • [28] Application of JSA, a Sentiment Analysis Model in Chinese Language and Literature, to Sentiment Recognition in Classical Poetry
    Li, Yali
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [29] DravidianCodeMix: sentiment analysis and offensive language identification dataset for Dravidian languages in code-mixed text
    Chakravarthi, Bharathi Raja
    Priyadharshini, Ruba
    Muralidaran, Vigneshwaran
    Jose, Navya
    Suryawanshi, Shardul
    Sherly, Elizabeth
    McCrae, John P.
    LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (03) : 765 - 806
  • [30] DravidianCodeMix: sentiment analysis and offensive language identification dataset for Dravidian languages in code-mixed text
    Bharathi Raja Chakravarthi
    Ruba Priyadharshini
    Vigneshwaran Muralidaran
    Navya Jose
    Shardul Suryawanshi
    Elizabeth Sherly
    John P. McCrae
    Language Resources and Evaluation, 2022, 56 : 765 - 806