Relational Turkish Text Classification Using Distant Supervised Entities and Relations

被引:1
|
作者
Okur, Halil Ibrahim [1 ,2 ]
Tohma, Kadir [1 ]
Sertbas, Ahmet [2 ]
机构
[1] Iskenderun Tech Univ, Fac Engn & Nat Sci, Dept Comp Engn, TR-31200 Hatay, Turkiye
[2] Istanbul Univ Cerrahpasa, Fac Engn, Dept Comp Engn, TR-34310 Istanbul, Turkiye
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 02期
关键词
Text classification; relation extraction; NER; distant supervision; deep learning; machine learning; MODEL;
D O I
10.32604/cmc.2024.050585
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification, by automatically categorizing texts, is one of the foundational elements of natural language processing applications. This study investigates how text classification performance can be improved through the integration of entity-relation information obtained from the Wikidata (Wikipedia database) database and BERTbased pre-trained Named Entity Recognition (NER) models. Focusing on a significant challenge in the field of natural language processing (NLP), the research evaluates the potential of using entity and relational information to extract deeper meaning from texts. The adopted methodology encompasses a comprehensive approach that includes text preprocessing, entity detection, and the integration of relational information. Experiments conducted on text datasets in both Turkish and English assess the performance of various classification algorithms, such as Support Vector Machine, Logistic Regression, Deep Neural Network, and Convolutional Neural Network. The results indicate that the integration of entity-relation information can significantly enhance algorithm performance in text classification tasks and offer new perspectives for information extraction and semantic analysis in NLP applications. Contributions of this work include the utilization of distant supervised entity-relation information in Turkish text classification, the development of a Turkish relational text classification approach, and the creation of a relational database. By demonstrating potential performance improvements through the integration of distant supervised entity-relation information into Turkish text classification, this research aims to support the effectiveness of text-based artificial intelligence (AI) tools. Additionally, it makes significant contributions to the development of multilingual text classification systems by adding deeper meaning to text content, thereby providing a valuable addition to current NLP studies and setting an important reference point for future research.
引用
收藏
页码:2209 / 2228
页数:20
相关论文
共 50 条
  • [1] Relational Turkish text classification with graph neural networks
    Okur, Halil Ibrahim
    Tohma, Kadir
    Sertbas, Ahmet
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024,
  • [2] Document Embedding based Supervised Methods for Turkish Text Classification
    Celenli, Halil I.
    Ozturk, S. Talha
    Sahin, Gurkan
    Gerek, Aydin
    Ganiz, Murat C.
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 477 - 482
  • [3] Entities and relations for historical relational databases
    Morris, RA
    Khatib, L
    FOURTH INTERNATIONAL WORKSHOP ON TEMPORAL REPRESENTATION AND REASONING, PROCEEDINGS, 1997, : 180 - 186
  • [4] Jointly Embedding Entities and Text with Distant Supervision
    Newman-Griffis, Denis
    Lai, Albert M.
    Fosler-Lussier, Eric
    REPRESENTATION LEARNING FOR NLP, 2018, : 195 - 206
  • [5] Turkish Medical Text Classification Using BERT
    Celikten, Azer
    Bulut, Hasan
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [6] ESRE: handling repeated entities in distant supervised relation extraction
    Xin Sun
    Jinghu Jiang
    Yuming Shang
    Neural Computing and Applications, 2021, 33 : 11325 - 11337
  • [7] ESRE: handling repeated entities in distant supervised relation extraction
    Sun, Xin
    Jiang, Jinghu
    Shang, Yuming
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 11325 - 11337
  • [8] Segmenting handwritten text using supervised classification techniques
    Sun, Y
    Butler, TS
    Shafarenko, A
    Adams, R
    Loomes, M
    Davey, N
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 657 - 662
  • [9] Text Classification Using Semi-Supervised Clustering
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 197 - 200
  • [10] Recognizing Sentiment of Relations between Entities in Text
    Wang, Jun
    Ren, Fuji
    Li, Lei
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2014, 9 (06) : 614 - 620