Towards Better Sentiment Analysis in the Turkish Language: Dataset Improvements and Model Innovations

被引:0
|
作者
Zumberoglu, Kevser Busra [1 ,2 ]
Dik, Sumeyye Zulal [1 ,2 ]
Karadeniz, Busra Sinem [1 ,2 ]
Sahmoud, Shaaban [1 ,2 ]
机构
[1] Fatih Sultan Mehmet Vakif Univ, Comp Engn Dept, TR-34015 Istanbul, Turkiye
[2] Fatih Sultan Mehmet Vakif Univ, Data Sci Applicat & Res Ctr VEBIM, TR-34015 Istanbul, Turkiye
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期
关键词
sentiment analysis; turkish language; turkish sentiment analysis; BERT;
D O I
10.3390/app15042062
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sentiment analysis in the Turkish language has gained increasing attention due to the growing availability of Turkish textual data across various domains. However, existing datasets often suffer from limitations such as insufficient size, lack of diversity, and annotation inconsistencies, which hinder the development of robust and accurate sentiment analysis models. In this study, we present a novel enhanced dataset specifically designed to address these challenges, providing a comprehensive and high-quality resource for Turkish sentiment analysis. We perform a comparative evaluation of previously proposed models using our dataset to assess their performance and limitations. Experimental findings demonstrate the effectiveness of the presented dataset and trained models, offering valuable insights for advancing sentiment analysis research in the Turkish language. These results underscore the critical role of the enhanced dataset in bridging the gap between existing datasets and the importance of training the modern sentiment analysis models on scalable, balanced, and curated datasets. This can offer valuable insights for advancing sentiment analysis research in the Turkish language. Furthermore, the experimental results represent an important step in overcoming the challenges associated with Turkish sentiment analysis and improving the performance of existing models.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Twitter Dataset and Evaluation of Transformers for Turkish Sentiment Analysis
    Koksal, Abdullatif
    Ozgur, Arzucan
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [2] Preparation of Improved Turkish DataSet for Sentiment Analysis in Social Media
    Makinist, Semiha
    Hallac, Ibrahim Riza
    Karakus, Betul Ay
    Aydin, Galip
    2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL MATHEMATICS AND ENGINEERING SCIENCES (CMES2017), 2017, 13
  • [3] Improvements of Webometrics by Using Sentiment Analysis for Better Accessibility of the Web
    Malinsky, Radek
    Jelinek, Ivan
    CURRENT TRENDS IN WEB ENGINEERING, 2010, 6385s : 581 - 586
  • [4] Towards Language-independent Sentiment Analysis
    Abudawood, Tarek
    Alraqibah, Heelah
    Alsanie, Waleed
    2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC), 2018,
  • [5] A Dataset and BERT-based Models for Targeted Sentiment Analysis on Turkish Texts
    Mutlu, M. Melih
    Ozgur, Arzucan
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 467 - 472
  • [6] Human-annotated dataset for social media sentiment analysis for Albanian language
    Kadriu, Fatbardh
    Murtezaj, Doruntina
    Gashi, Fatbardh
    Ahmedi, Lule
    Kurti, Arianit
    Kastrati, Zenun
    DATA IN BRIEF, 2022, 43
  • [7] Producing An Instagram Dataset For Persian Language Sentiment Analysis Using Crowdsourcing Method
    Heidari, Mahsa
    Shamsinejad, Pirooz
    2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 284 - 287
  • [8] The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
    Stappen, Lukas
    Baird, Alice
    Schumann, Lea
    Schuller, Bjoern
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1334 - 1350
  • [9] Detecting Happiness in Italian Tweets: Towards an Evaluation Dataset for Sentiment Analysis in Felicitta
    Bosco, C.
    Allisio, L.
    Mussa, V.
    Patti, V.
    Ruffo, G.
    Sanguinetti, M.
    Sulis, E.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [10] Sentiment Analysis in Turkish Sign Language Through Facial Expressions and Hand Gestures
    Takir, Seyma
    Bilen, Baris
    Arslan, Dogukan
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,