Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression

被引:0
|
作者
Goutam Majumder
Partha Pakray
Ranjita Das
David Pinto
机构
[1] Lovely Professional University,
[2] National Institute of Technology Silchar,undefined
[3] National Institute of Technology Mizoram,undefined
[4] Benemérita Universidad Autónoma de Puebla,undefined
来源
Applied Intelligence | 2021年 / 51卷
关键词
Semantic textual similarity; Natural language understanding; Text classification; Multivariate regression;
D O I
暂无
中图分类号
学科分类号
摘要
The proposed work is focused on establishing an interpretable Semantic Textual Similarity (iSTS) method for a pair of sentences, which can clarify why two sentences are completely or partially similar or have some variations. This proposed interpretable approach is a pipeline of five modules that begins with the pre-processing and chunking of text. Further chunks of two sentences are aligned using a one–to–multi (1:M) chunk aligner. Thereafter, support vector, Gaussian Naive Bayes and k–Nearest Neighbours classifiers are then used to create a multiclass classification algorithm, and different class labels are used to define an alignment type. At last, a multivariate regression algorithm is developed to find the semantic equivalence of an alignment with a score (that ranges from 0 to 5). The efficiency of the proposed method is verified on three different datasets and also compared to other state–of–the–art interpretable STS (iSTS) methods. The evaluated results show that the proposed method performs better than other iSTS methods. Most importantly, the modules of the proposed iSTS method are used to develop a Textual Entailment (TE) method. It is found that, when we combined chunk level, alignment, and sentence level features the entailment results significantly improves.
引用
收藏
页码:7322 / 7349
页数:27
相关论文
共 50 条
  • [31] netDx: interpretable patient classification using integrated patient similarity networks
    Pai, Shraddha
    Hui, Shirley
    Isserlin, Ruth
    Shah, Muhammad A.
    Kaka, Hussam
    Bader, Gary D.
    MOLECULAR SYSTEMS BIOLOGY, 2019, 15 (03)
  • [32] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning
    Einea, Omar
    Elnagar, Ashraf
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [33] Semantic textual similarity for modern standard and dialectal Arabic using transfer learning
    Sulaiman, Mansour Al
    Moussa, Abdullah M.
    Abdou, Sherif
    Elgibreen, Hebah
    Faisal, Mohammed
    Rashwan, Mohsen
    PLOS ONE, 2022, 17 (08):
  • [34] Evaluation of semantic similarity using vector space model based on textual corpus
    Hssina, Badr
    Bouikhalene, Belaid
    Merbouha, Abdelkrim
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 295 - 300
  • [35] Cross-Lingual Semantic Textual Similarity Modeling Using Neural Networks
    Li, Xia
    Chen, Minping
    Zeng, Zihang
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 52 - 62
  • [36] Evaluating Question generation models using QA systems and Semantic Textual Similarity
    Shaheer, Safwan
    Hossain, Ishmam
    Sarna, Sudipta Nandi
    Mehedi, Md Humaion Kabir
    Rasel, Annajiat Alim
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 431 - 435
  • [37] Emotion Recognition for Sentences with Unknown Expressions based on Semantic Similarity by Using Bag of Concepts
    Matsumoto, Kazuyuki
    Yoshida, Minoru
    Xiao, Qingmei
    Luo, Xin
    Kita, Kenji
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1394 - 1399
  • [38] Genetic algorithm based feature selection and parameter optimization for support vector regression applied to semantic textual similarity
    Su B.-H.
    Wang Y.-L.
    Journal of Shanghai Jiaotong University (Science), 2015, 20 (02) : 143 - 148
  • [39] Semantic Similarity Analysis for Examination Questions Classification Using WordNet
    Goh, Thing Thing
    Jamaludin, Nor Azliana Akmal
    Mohamed, Hassan
    Ismail, Mohd Nazri
    Chua, Huangshen
    APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [40] Genetic Algorithm Based Feature Selection and Parameter Optimization for Support Vector Regression Applied to Semantic Textual Similarity
    苏柏桦
    王英林
    JournalofShanghaiJiaotongUniversity(Science), 2015, 20 (02) : 143 - 148