Name Indexing in Indonesian Translation of Hadith using Named Entity Recognition with Na ve Bayes Classifier

被引:4
|
作者
Azalia, Fadhila Yasmine [1 ]
Bijaksana, Moch Arif [1 ]
Huda, Arief Fatchul [2 ]
机构
[1] Telkom Univ, Sch Comp, Bandung 40257, Indonesia
[2] UIN Sunan Gunung Jati, Fac Sci & Technol, Bandung 40614, Indonesia
关键词
Named Entity Recognition; Naive Bayes Classifier; Index; Hadith;
D O I
10.1016/j.procs.2019.08.151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hadith is believed to be the main source of Islam after Qur'an. The simplicity of obtaining hadith information is currently supported by global access using the internet. The abundance of hadith literature sometimes finds difficulties to obtain the information that needed. Therefore, information extraction is required to facilitate the searching of information in hadith. In this study, the name indexing in Indonesian translation of hadith from nine narrators was built. The model was built using Named Entity Recognition with Naive Bayes classifier. The features used in this study are title case, POS tag and unigram. This study experimented with individual features and features that were combined. Precision, recall, and F1-Score are employed as evaluation metrics. F1-Score is used in this study to measure the performance of named entity and features. The results of experiments extracted 258 people's names from 13870 token data from 100 Indonesian hadith texts and show that implementing the combination of all features can achieve 82.63% of F1-Score. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/) Peer-review under responsibility of the scientific committee of the 4th International Conference on Computer Science and Computational Intelligence 2019.
引用
收藏
页码:142 / 149
页数:8
相关论文
共 47 条
  • [1] Chemical named entity recognition in the texts of scientific publications using the naïve Bayes classifier approach
    O. A. Tarasova
    A. V. Rudik
    N. Yu. Biziukova
    D. A. Filimonov
    V. V. Poroikov
    Journal of Cheminformatics, 14
  • [2] Narrator's Name Recognition with Support Vector Machine for Indexing Indonesian Hadith translations
    Yusup, Fajar Achmad
    Bijaksana, Moch Arif
    Huda, Arief Fatchul
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 191 - 198
  • [3] Chemical named entity recognition in the texts of scientific publications using the naive Bayes classifier approach
    Tarasova, O. A.
    Rudik, A., V
    Biziukova, N. Yu
    Filimonov, D. A.
    Poroikov, V. V.
    JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [4] Indexing Name in Hadith Translation Using Hidden Markov Model (HMM)
    Sari, Widia Permata
    Bijaksana, Moch Arif
    Huda, Arief Fatchul
    2019 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2019, : 552 - 556
  • [5] Bengali Named Entity Recognition using Classifier Combination
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    ICAPR 2009: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, PROCEEDINGS, 2009, : 259 - 262
  • [6] Classifier Ensemble using Multiobjective Optimization for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 783 - 788
  • [7] Boosting drug named entity recognition using an aggregate classifier
    Korkontzelos, Ioannis
    Piliouras, Dimitrios
    Dowsey, Andrew W.
    Ananiadou, Sophia
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2015, 65 (02) : 145 - 153
  • [8] Indonesian Protected Health Information Removal using Named Entity Recognition
    Al-Ash, Herley Shaori
    Fanany, Ivan
    Bustamam, Alhadi
    PROCEEDINGS OF 2019 12TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2019, : 258 - 263
  • [9] Named Entity Recognition on Indonesian Tweets using Hidden Markov Model
    Azarine, Indira Suri
    Bijaksana, Moch Arif
    Asror, Ibnu
    2019 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2019, : 547 - 551
  • [10] Medical Named Entity Recognition for Indonesian Language Using Word Representations
    Rahman, Arief
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND DIGITAL APPLICATIONS (ICITDA 2017), 2018, 325