Sense disambiguation for Punjabi language using supervised machine learning techniques

被引:2
|
作者
Singh, Varinder Pal [1 ]
Kumar, Parteek [1 ]
机构
[1] Thapar Inst Engn & Technol, Comp Sci & Engn Dept, Patiala 147004, Punjab, India
关键词
Lexical features; syntactic features; word embedding; supervised learning techniques; word sense disambiguation;
D O I
10.1007/s12046-019-1206-x
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Automatic identification of a meaning of a word in a context is termed as Word Sense Disambiguation (WSD). It is a vital and hard artificial intelligence problem used in several natural language processing applications like machine translation, question answering, information retrieval, etc. In this paper, an explicit WSD system for Punjabi language using supervised techniques has been analysed. The sense tagged corpus of 150 ambiguous Punjabi noun words has been manually prepared. The six supervised machine learning techniques Decision List, Decision Tree, Naive Bayes, K-Nearest Neighbour (K-NN), Random Forest and Support Vector Machines (SVM) have been investigated in this proposed work. Every classifier has used same feature space encompassing lexical (unigram, bigram, collocations, and co-occurrence) and syntactic (part of speech) count based features. The semantic features of Punjabi language have been devised from the unlabelled Punjabi Wikipedia text using word2vec continuous bag of word and skip gram shallow neural network models. Two deep learning neural network classifiers multilayer perceptron and long short term memory have also been applied for WSD of Punjabi words. The word embedding features have experimented on six classifiers for the Punjabi WSD task. It has been observed that the performance of the supervised classifiers applied for the WSD task of Punjabi language has been enhanced with the application of word embedding features. In this work, an accuracy of 84% has been achieved by LSTM classifier using word embedding feature.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Four methods for supervised word sense disambiguation
    Schumacher, Kinga
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 317 - 328
  • [32] KDSL: a Knowledge-Driven Supervised Learning Framework for Word Sense Disambiguation
    Yin, Shi
    Zhou, Yi
    Li, Chenguang
    Wang, Shangfei
    Ji, Jianmin
    Chen, Xiaoping
    Wang, Ruili
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [33] Semi-supervised learning integrated with classifier combination for word sense disambiguation
    Le, Anh-Cuong
    Shimazu, Akira
    Huynh, Van-Nam
    Nguyen, Le-Minh
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (04): : 330 - 345
  • [34] Detecting Mislabeled Data Using Supervised Machine Learning Techniques
    Poel, Mannes
    AUGMENTED COGNITION: NEUROCOGNITION AND MACHINE LEARNING, AC 2017, PT I, 2017, 10284 : 571 - 581
  • [35] Tamping Effectiveness Prediction Using Supervised Machine Learning Techniques
    Tan, Chang Wei
    Webb, Geoffrey I.
    Petitjean, Francois
    Reichl, Paul
    RAILWAY DEVELOPMENT, OPERATIONS, AND MAINTENANCE: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON RAIL TRANSPORTATION 2017 (ICRT 2017), 2018, : 1010 - 1023
  • [36] IoT Attacks Detection Using Supervised Machine Learning Techniques
    Aljabri, Malak
    Shaahid, Afrah
    Alnasser, Fatima
    Saleh, Asalah
    Alomari, Dorieh
    Aboulnour, Menna
    Al-Eidarous, Walla
    Althubaity, Areej
    HighTech and Innovation Journal, 2024, 5 (03): : 534 - 550
  • [37] Breast cancer prediction using supervised machine learning techniques
    Dadheech, Pankaj
    Kalmani, Vijay
    Dogiwal, Sanwta Ram
    Sharma, Vijay Kumar
    Kumar, Ankit
    Pandey, Saroj Kumar
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (03): : 383 - 392
  • [38] Predicting the quality of air using supervised techniques of machine learning
    Sai Kumar, G.
    Mahalakshmi, D.
    Test Engineering and Management, 2019, 81 (11-12): : 5393 - 5398
  • [39] Teaching Performance Evaluation Using Supervised Machine Learning Techniques
    Dragomir, Elia Georgiana
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING, ICVL 2010, 2010, : 390 - 394
  • [40] Fake news detection using supervised machine learning techniques
    Malhotra, Pooja
    Malik, Sanjay Kumar
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (01): : 7 - 15