Distributional Semantic Model Based on Convolutional Neural Network for Arabic Textual Similarity

被引:3
|
作者
Mahmoud, Adnen [1 ]
Zrigui, Mounir [2 ]
机构
[1] Higher Inst Comp Sci & Commun Tech, Monastir, Tunisia
[2] Fac Sci Monastir, Monastir, Tunisia
关键词
Arabic Language; Context Based Approach; Global Vectors Representation; Natural Language Processing; Paraphrase Detection; Semantic Similarity; Word Embedding; Word2vec;
D O I
10.4018/IJCINI.2020010103
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem addressed is to develop a model that can reliably identify whether a previously unseen document pair is paraphrased or not. Its detection in Arabic documents is a challenge because of its variability in features and the lack of publicly available corpora. Faced with these problems, the authors propose a semantic approach. At the feature extraction level, the authors use global vectors representation combining global co-occurrence counting and a contextual skip gram model. At the paraphrase identification level, the authors apply a convolutional neural network model to learn more contextual and semantic information between documents. For experiments, the authors use Open Source Arabic Corpora as a source corpus. Then the authors collect different datasets to create a vocabulary model. For the paraphrased corpus construction, the authors replace each word from the source corpus by its most similar one which has the same grammatical class applying the word2vec algorithm and the part-of-speech annotation. Experiments show that the model achieves promising results in terms of precision and recall compared to existing approaches in the literature.
引用
收藏
页码:35 / 50
页数:16
相关论文
共 50 条
  • [11] Semantic Fire Segmentation Model Based on Convolutional Neural Network for Outdoor Image
    Choi, Han-Soo
    Jeon, Myeongho
    Song, Kyungmin
    Kang, Myungjoo
    FIRE TECHNOLOGY, 2021, 57 (06) : 3005 - 3019
  • [12] Asymmetric Parallel Semantic Segmentation Model Based on Full Convolutional Neural Network
    Li B.-Q.
    He Y.-Y.
    He L.-J.
    Qiang W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (05): : 1058 - 1064
  • [13] Semantic Fire Segmentation Model Based on Convolutional Neural Network for Outdoor Image
    Han-Soo Choi
    Myeongho Jeon
    Kyungmin Song
    Myungjoo Kang
    Fire Technology, 2021, 57 : 3005 - 3019
  • [14] Ontology semantic integration based on convolutional neural network
    Yang Feng
    Lidan Fan
    Neural Computing and Applications, 2019, 31 : 8253 - 8266
  • [15] Ontology semantic integration based on convolutional neural network
    Feng, Yang
    Fan, Lidan
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12): : 8253 - 8266
  • [16] A Semantic Textual Similarity Calculation Model Based on Pre-training Model
    Ding, Zhaoyun
    Liu, Kai
    Wang, Wenhao
    Liu, Bin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 3 - 15
  • [17] Semantic Textual Similarity Justification Based on Multi-Model Ensemble
    Su, Jindian
    Hong, Xiaobin
    Yu, Shanshan
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2022, 50 (04): : 1 - 9
  • [18] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning
    Einea, Omar
    Elnagar, Ashraf
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [19] Semantic textual similarity for modern standard and dialectal Arabic using transfer learning
    Sulaiman, Mansour Al
    Moussa, Abdullah M.
    Abdou, Sherif
    Elgibreen, Hebah
    Faisal, Mohammed
    Rashwan, Mohsen
    PLOS ONE, 2022, 17 (08):
  • [20] A recognition model for handwritten Persian/Arabic numbers based on optimized deep convolutional neural network
    Ali, Saqib
    Sahiba, Sana
    Azeem, Muhammad
    Shaukat, Zeeshan
    Mahmood, Tariq
    Sakhawat, Zareen
    Aslam, Muhammad Saqlain
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 14557 - 14580