Hindi Word Sense Disambiguation Using Cosine Similarity

被引:2
|
作者
Sarika, D. K. [1 ]
Sharma, Dilip Kumar [1 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura, India
关键词
Word sense disambiguation; Natural language processing; Ambiguity; Hindi WordNet; Cosine similarity;
D O I
10.1007/978-981-10-0135-2_76
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hindi is the regional language of India. Most of the people access, retrieve, and share documents in Hindi language. As all the natural languages possess property of being ambiguous, so does Hindi language, which creates obstacles in usage of information technology properly. In order to remove ambiguity from Hindi language, we need a system called Hindi word sense disambiguation (HWSD). In this paper, we present a supervised method, called HWSD using cosine similarity in which vectors are created for testing query and sense knowledge data for the ambiguous word by considering weights. Experiment is performed on dataset consisting of 90 Hindi ambiguous words and it is found that this method outperforms Lesk's algorithm which is well known algorithm for Word sense disambiguation (WSD). We obtained an overall average precision of 78.99 % and average recall of 72.58 %.
引用
收藏
页码:801 / 808
页数:8
相关论文
共 50 条
  • [31] Applying a Naive Bayes Similarity Measure to Word Sense Disambiguation
    Wang, Tong
    Hirst, Graeme
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 531 - 537
  • [32] CHINESE QUESTION SIMILARITY CALCULATION BASED ON WORD SENSE DISAMBIGUATION
    Pang, Xiu-Ling
    Jia, Ke-Liang
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2217 - +
  • [33] Word Sense Disambiguation with a Similarity-Smoothed Case Library
    Dekang Lin
    Computers and the Humanities, 2000, 34 : 147 - 152
  • [34] Word sense disambiguation with a similarity-smoothed case library
    Lin, D
    COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2): : 147 - 152
  • [35] Unsupervised Word Sense Disambiguation Using Word Embeddings
    Moradi, Behzad
    Ansari, Ebrahim
    Zabokrtsky, Zdenek
    PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 228 - 233
  • [36] Unsupervised similarity-based word sense disambiguation using context vectors and sentential word importance
    Abdalgader, Khaled
    Skabar, Andrew
    ACM Transactions on Speech and Language Processing, 2012, 9 (01):
  • [37] Word Sense Disambiguation Using Clustered Sense Labels
    Park, Jeong Yeon
    Shin, Hyeong Jin
    Lee, Jae Sung
    APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [38] Arabic word sense disambiguation using sense inventories
    Alian M.
    Awajan A.
    International Journal of Information Technology, 2023, 15 (2) : 735 - 744
  • [39] Short-Text Similarity Measurement Using Word Sense Disambiguation and Synonym Expansion
    Abdalgader, Khaled
    Skabar, Andrew
    AI 2010: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2010, 6464 : 435 - 444
  • [40] Word Sense Disambiguation applied to Assamese-Hindi Bilingual Statistical Machine Translation
    Barman, Anup Kumar
    Sarmah, Jumi
    Basimatary, Subungshri
    Nag, Amitava
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (01) : 12581 - 12586