Integrating semantic similarity with Dirichlet multinomial mixture model for enhanced web service clustering

被引:0
|
作者
Neha Agarwal
Geeta Sikka
Lalit Kumar Awasthi
机构
[1] Indian Institute of Information Technology Raichur,Computer Science and Engineering
[2] National Institute of Technology Delhi,Computer Science and Engineering
[3] National Institute of Technology Uttarakhand,undefined
来源
关键词
Web service clustering; Word embedding techniques; Dirichlet multinomial mixture model (DMM); Topic modeling techniques; Clustering techniques;
D O I
暂无
中图分类号
学科分类号
摘要
With accelerated advancement of web 2.0, developers generally describe the functionality of services in short natural text. Keyword-based searching techniques are not an efficient way of discovering services from repositories. It suffers from vocabulary problems. Latent Dirichlet allocation (LDA) with word embedding techniques is widely adopted for efficiently extracting latent features from the service descriptions. However, LDA is not efficient on short text due to limited content and inadequate occurring words. The word vectors generated by word embedding techniques are of finer quality than topic modeling techniques. Gibbs sampling algorithm for Dirichlet multinomial mixture (GSDMM) model gives better results on web service description documents because it provides one topic corresponding to one document. In this paper, we evaluate the performance of GSDMM model with word embeddings and propose WV+GSDMMK model. The proposed model improves service-to-topic mapping by determining semantic similarity among features. K-means clustering is applied on service to topic representation. Results are evaluated on five real-time datasets based on intrinsic and extrinsic evaluation measures. Experimental results demonstrate that the proposed method outperforms other baseline techniques, and the accuracy score is also increased by 5%, 18%, 3%, 4%, and 6% on datasets DS1, DS2, DS3, DS4, and DS5, respectively.
引用
收藏
页码:2327 / 2353
页数:26
相关论文
共 50 条
  • [1] Integrating semantic similarity with Dirichlet multinomial mixture model for enhanced web service clustering
    Agarwal, Neha
    Sikka, Geeta
    Awasthi, Lalit Kumar
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (04) : 2327 - 2353
  • [2] A Web service clustering method based on topic enhanced Gibbs sampling algorithm for the Dirichlet Multinomial Mixture model and service collaboration graph
    Hu, Qiang
    Shen, Jiaji
    Wang, Kun
    Du, Junwei
    Du, Yuyue
    INFORMATION SCIENCES, 2022, 586 : 239 - 260
  • [3] Evaluation of web service clustering using Dirichlet Multinomial Mixture model based approach for Dimensionality Reduction in service representation
    Agarwal, Neha
    Sikka, Geeta
    Awasthi, Lalit Kumar
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
  • [4] An Adaptive Dirichlet Multinomial Mixture Model for Short Text Streaming Clustering
    Duan, Ruting
    Li, Chunping
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 49 - 55
  • [5] Tensor Dirichlet Process Multinomial Mixture Model with Graphs for Passenger Trajectory Clustering
    Li, Ziyue
    Yan, Hao
    Zhang, Chen
    Ketter, Wolfgang
    Tsung, Fugee
    PROCEEDINGS OF THE 6TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON AI FOR GEOGRAPHIC KNOWLEDGE DISCOVERY, GEOAI 2023, 2023, : 121 - 128
  • [6] A Dirichlet Multinomial Mixture Model-based Approach for Short Text Clustering
    Yin, Jianhua
    Wang, Jianyong
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 233 - 242
  • [7] ChromDMM: a Dirichlet-multinomial mixture model for clustering heterogeneous epigenetic data
    Osmala, Maria
    Eraslan, Gokcen
    Lahdesmaki, Harri
    BIOINFORMATICS, 2022, 38 (16) : 3863 - 3870
  • [8] A Semantic Based Dirichlet Compound Multinomial Model
    Cerchiello, Paola
    Bonafede, Elvio Concetto
    NEW PERSPECTIVES IN STATISTICAL MODELING AND DATA ANALYSIS, 2011, : 525 - 533
  • [9] Railway Fault Text Clustering Method Using an Improved Dirichlet Multinomial Mixture Model
    Yang, Ni
    Zhang, Youpeng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [10] Web service discovery among large service pools utilising semantic similarity and clustering
    Chen, Fuzan
    Li, Minqiang
    Wu, Harris
    Xie, Lingli
    ENTERPRISE INFORMATION SYSTEMS, 2017, 11 (03) : 452 - 469