Sense disambiguation for Punjabi language using supervised machine learning techniques

被引:0
|
作者
Varinder Pal Singh
Parteek Kumar
机构
[1] Thapar Institute of Engineering and Technology,Computer Science and Engineering Department
来源
Sādhanā | 2019年 / 44卷
关键词
Lexical features; syntactic features; word embedding; supervised learning techniques; word sense disambiguation;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic identification of a meaning of a word in a context is termed as Word Sense Disambiguation (WSD). It is a vital and hard artificial intelligence problem used in several natural language processing applications like machine translation, question answering, information retrieval, etc. In this paper, an explicit WSD system for Punjabi language using supervised techniques has been analysed. The sense tagged corpus of 150 ambiguous Punjabi noun words has been manually prepared. The six supervised machine learning techniques Decision List, Decision Tree, Naive Bayes, K-Nearest Neighbour (K-NN), Random Forest and Support Vector Machines (SVM) have been investigated in this proposed work. Every classifier has used same feature space encompassing lexical (unigram, bigram, collocations, and co-occurrence) and syntactic (part of speech) count based features. The semantic features of Punjabi language have been devised from the unlabelled Punjabi Wikipedia text using word2vec continuous bag of word and skip gram shallow neural network models. Two deep learning neural network classifiers multilayer perceptron and long short term memory have also been applied for WSD of Punjabi words. The word embedding features have experimented on six classifiers for the Punjabi WSD task. It has been observed that the performance of the supervised classifiers applied for the WSD task of Punjabi language has been enhanced with the application of word embedding features. In this work, an accuracy of 84% has been achieved by LSTM classifier using word embedding feature.
引用
收藏
相关论文
共 50 条
  • [1] Sense disambiguation for Punjabi language using supervised machine learning techniques
    Singh, Varinder Pal
    Kumar, Parteek
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (11):
  • [2] Word sense disambiguation for Punjabi language using deep learning techniques
    Singh, Varinder Pal
    Kumar, Parteek
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 2963 - 2973
  • [3] Word sense disambiguation for Punjabi language using deep learning techniques
    Varinder pal Singh
    Parteek Kumar
    Neural Computing and Applications, 2020, 32 : 2963 - 2973
  • [4] Word Sense Disambiguation: Supervised Program Interpretation Methodology for Punjabi Language
    Walia, Himdweep
    Rana, Ajay
    Kansal, Vineet
    2018 7TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO) (ICRITO), 2018, : 762 - 767
  • [5] Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation
    Mahajan, Rupesh
    Kokane, Chandrakant
    Pathak, Kishor
    Kodmelwar, Manohar
    Wagh, Kapil
    Bhandari, Mahesh
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 436 - 443
  • [6] Word sense disambiguation for punjabi language using overlap based approach
    Rana, Preeti
    Kumar, Parteek
    Advances in Intelligent Systems and Computing, 2015, 320
  • [7] Assamese Word Sense Disambiguation using Supervised Learning
    Borah, Pranjal Protim
    Talukdar, Gitimoni
    Baruah, Arup
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 946 - 950
  • [8] NAIVE BAYES CLASSIFIER FOR WORD SENSE DISAMBIGUATION OF PUNJABI LANGUAGE
    Singh, Varinder Pal
    Kumar, Parteek
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2018, 31 (03) : 188 - 199
  • [9] Machine Learning Techniques for Myanmar Word-Sense Disambiguation
    Khaing, Phyu Phyu
    Aung, Than Nwe
    GENETIC AND EVOLUTIONARY COMPUTING, VOL I, 2016, 387 : 175 - 185
  • [10] Word Sense Disambiguation in Bangla Language Using Supervised Methodology with Necessary Modifications
    Pal A.R.
    Saha D.
    Dash N.S.
    Pal A.
    Pal, Alok Ranjan (chhaandasik@gmail.com), 2018, Springer (99) : 519 - 526