Relational Turkish Text Classification Using Distant Supervised Entities and Relations

被引:1
|
作者
Okur, Halil Ibrahim [1 ,2 ]
Tohma, Kadir [1 ]
Sertbas, Ahmet [2 ]
机构
[1] Iskenderun Tech Univ, Fac Engn & Nat Sci, Dept Comp Engn, TR-31200 Hatay, Turkiye
[2] Istanbul Univ Cerrahpasa, Fac Engn, Dept Comp Engn, TR-34310 Istanbul, Turkiye
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 02期
关键词
Text classification; relation extraction; NER; distant supervision; deep learning; machine learning; MODEL;
D O I
10.32604/cmc.2024.050585
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification, by automatically categorizing texts, is one of the foundational elements of natural language processing applications. This study investigates how text classification performance can be improved through the integration of entity-relation information obtained from the Wikidata (Wikipedia database) database and BERTbased pre-trained Named Entity Recognition (NER) models. Focusing on a significant challenge in the field of natural language processing (NLP), the research evaluates the potential of using entity and relational information to extract deeper meaning from texts. The adopted methodology encompasses a comprehensive approach that includes text preprocessing, entity detection, and the integration of relational information. Experiments conducted on text datasets in both Turkish and English assess the performance of various classification algorithms, such as Support Vector Machine, Logistic Regression, Deep Neural Network, and Convolutional Neural Network. The results indicate that the integration of entity-relation information can significantly enhance algorithm performance in text classification tasks and offer new perspectives for information extraction and semantic analysis in NLP applications. Contributions of this work include the utilization of distant supervised entity-relation information in Turkish text classification, the development of a Turkish relational text classification approach, and the creation of a relational database. By demonstrating potential performance improvements through the integration of distant supervised entity-relation information into Turkish text classification, this research aims to support the effectiveness of text-based artificial intelligence (AI) tools. Additionally, it makes significant contributions to the development of multilingual text classification systems by adding deeper meaning to text content, thereby providing a valuable addition to current NLP studies and setting an important reference point for future research.
引用
收藏
页码:2209 / 2228
页数:20
相关论文
共 50 条
  • [21] A neural model for type classification of entities for text
    Li, Qi
    Dong, JunQi
    Zhong, Jiang
    Li, Qing
    Wang, Chen
    KNOWLEDGE-BASED SYSTEMS, 2019, 176 : 122 - 132
  • [22] Jointly Extract Entities and Their Relations From Biomedical Text
    Chen, Jizhi
    Gu, Junzhong
    IEEE ACCESS, 2019, 7 : 162818 - 162827
  • [23] The Importance of preprocessing in Turkish Text Classification
    Acikalin, Buse
    Bayazit, Nilgun Guler
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 2053 - 2056
  • [24] Active Learning for Turkish Text Classification
    Sapci, Ali Osman Berk
    Tastan, Oznur
    Yeniterzi, Reyyan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [25] A supervised clustering method for text classification
    Pappuswamy, U
    Bhembe, D
    Jordan, PW
    VanLehn, K
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 704 - 714
  • [26] Text Classification Using Document-Relational Graph Convolutional Networks
    Liu, Chongyi
    Wang, Xiangyu
    Xu, Honglei
    IEEE ACCESS, 2022, 10 : 123205 - 123211
  • [27] Text segmentation on multilabel documents: A distant-supervised approach
    Manchanda, Saurav
    Karypis, George
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1170 - 1175
  • [28] Arabic Text Classification of News Articles Using Classical Supervised Classifiers
    Al Qadi, Leen
    El Rifai, Hozayfa
    Obaid, Safa
    Elnagar, Ashraf
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 238 - 243
  • [29] Learning to Extract Relations for Relational Classification
    Rendle, Steffen
    Preisach, Christine
    Schmidt-Thieme, Lars
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 1062 - 1071
  • [30] SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering
    Ahmed, Mohammad Salim
    Khan, Latifur
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 1 - 6