Relational Turkish Text Classification Using Distant Supervised Entities and Relations

被引:1
|
作者
Okur, Halil Ibrahim [1 ,2 ]
Tohma, Kadir [1 ]
Sertbas, Ahmet [2 ]
机构
[1] Iskenderun Tech Univ, Fac Engn & Nat Sci, Dept Comp Engn, TR-31200 Hatay, Turkiye
[2] Istanbul Univ Cerrahpasa, Fac Engn, Dept Comp Engn, TR-34310 Istanbul, Turkiye
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 02期
关键词
Text classification; relation extraction; NER; distant supervision; deep learning; machine learning; MODEL;
D O I
10.32604/cmc.2024.050585
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification, by automatically categorizing texts, is one of the foundational elements of natural language processing applications. This study investigates how text classification performance can be improved through the integration of entity-relation information obtained from the Wikidata (Wikipedia database) database and BERTbased pre-trained Named Entity Recognition (NER) models. Focusing on a significant challenge in the field of natural language processing (NLP), the research evaluates the potential of using entity and relational information to extract deeper meaning from texts. The adopted methodology encompasses a comprehensive approach that includes text preprocessing, entity detection, and the integration of relational information. Experiments conducted on text datasets in both Turkish and English assess the performance of various classification algorithms, such as Support Vector Machine, Logistic Regression, Deep Neural Network, and Convolutional Neural Network. The results indicate that the integration of entity-relation information can significantly enhance algorithm performance in text classification tasks and offer new perspectives for information extraction and semantic analysis in NLP applications. Contributions of this work include the utilization of distant supervised entity-relation information in Turkish text classification, the development of a Turkish relational text classification approach, and the creation of a relational database. By demonstrating potential performance improvements through the integration of distant supervised entity-relation information into Turkish text classification, this research aims to support the effectiveness of text-based artificial intelligence (AI) tools. Additionally, it makes significant contributions to the development of multilingual text classification systems by adding deeper meaning to text content, thereby providing a valuable addition to current NLP studies and setting an important reference point for future research.
引用
收藏
页码:2209 / 2228
页数:20
相关论文
共 50 条
  • [41] A Text-Generated Method to Joint Extraction of Entities and Relations
    E, Haihong
    Xiao, Siqi
    Song, Meina
    APPLIED SCIENCES-BASEL, 2019, 9 (18):
  • [42] Multiple relations extraction among multiple entities in unstructured text
    Liu, Jin
    Ren, Haoliang
    Wu, Menglong
    Wang, Jin
    Kim, Hye-jin
    SOFT COMPUTING, 2018, 22 (13) : 4295 - 4305
  • [43] A comparative analysis of text classification for Turkish language
    Yildirim, Savas
    Yildiz, Tugba
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2018, 24 (05): : 879 - 886
  • [44] The Effect of Transfer Learning on Turkish Text Classification
    Sahin, Gurkan
    Diri, Banu
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [45] Zero-Shot Turkish Text Classification
    Birim, Ahmet
    Erden, Mustafa
    Arslan, Levent M.
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [46] LSRM: A New Method for Turkish Text Classification
    Borandag, Emin
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [47] A novel semi supervised approach for text classification
    Barman D.
    Chowdhury N.
    International Journal of Information Technology, 2020, 12 (4) : 1147 - 1157
  • [48] Knowledge Supervised Text Classification with No Labeled Documents
    Zhang, Congle
    Xue, Gui-Rong
    Yu, Yong
    PRICAI 2008: TRENDS IN ARTIFICIAL INTELLIGENCE, 2008, 5351 : 509 - +
  • [49] Self-supervised regularization for text classification
    Zhou M.
    Li Z.
    Xie P.
    Transactions of the Association for Computational Linguistics, 2021, 9 : 1147 - 1162
  • [50] RTextTools: A Supervised Learning Package for Text Classification
    Jurka, Timothy P.
    Collingwood, Loren
    Boydstun, Amber E.
    Grossman, Emiliano
    van Atteveldt, Wouter
    R JOURNAL, 2013, 5 (01): : 6 - 12