Design and Implementation of Word2Vec Parallel Algorithm Based on HPC

被引:0
|
作者
Yi, Xianyong [1 ]
Zheng, Rongge [1 ]
Wang, Aoyu [1 ]
Qin, Hao [1 ]
Chen, Yufeng [1 ]
机构
[1] Shandong Univ, Sch Mech Elect & Informat Engn, Weihai, Weihai, Peoples R China
关键词
HPC; Word2Vec; Parallel Algorithm; Natural Language Processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word2Vec, (Word to Vector) processes natural language by calculating the cosine similarity. However, the serial algorithm of original Word2Vec fails to satisfy the demands of training of corpus text because of the explosive growth of information. It has become the bottleneck owing to its comparatively low processing efficiency. The High Performance Computing (HPC) specializes in improving the calculation efficiency; therefore, the training efficiency of corpus texts can be greatly improved by parallelizing Word2Vec algorithm. After analyzing the characteristics of the Word2Vec algorithm in detail, we design and implement a parallel Word2Vec algorithm and use it to train corpus text on HPC. Furthermore, the corpus texts of different sizes are collected and trained, and the speed-up ratio is calculated by using the serial algorithm and parallel algorithm of Word2Vec, respectively. The experimental results show that there is a higher speed-up ratio when using the Word2Vec parallel algorithm running on HPC.
引用
收藏
页码:585 / 590
页数:6
相关论文
共 50 条
  • [1] Research and Implementation of Hybrid Recommendation Algorithm Based on Collaborative Filtering and Word2Vec
    Xiao, Yao
    Shi, Quan
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 172 - 175
  • [2] Chinese Text Summarization Algorithm Based on Word2vec
    Xu Chengzhang
    Liu Dan
    2018 INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2018), 2018, 976
  • [3] Movie Recommendation using Metadata based Word2Vec Algorithm
    Yoon, Yeo Chan
    Lee, Jun Woo
    2018 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON18), 2018, : 33 - 37
  • [4] Link Prediction Algorithm Based on Word2vec and Particle Swarm
    Jia C.-F.
    Han H.
    Lv Y.-N.
    Zhang L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (08): : 1703 - 1713
  • [5] Word Semantic Similarity Calculation Based on Word2vec
    Jin, Xiaolin
    Zhang, Shuwu
    Liu, Jie
    2018 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2018, : 12 - 16
  • [6] Word Clustering based on Word2vec and Semantic Similarity
    Luo Jie
    Wang Qinglin
    Li Yuan
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 517 - 521
  • [7] Study on Tibetan Word Vector based on Word2vec
    Yang, Ning
    Li, Guanyu
    Ding, Hailan
    Gong, Chunwei
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [8] A text retrieval algorithm based on the hybrid LDA and Word2Vec model
    Mu, Xue
    2019 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA & SMART CITY (ICITBS), 2019, : 373 - 376
  • [9] Research on application of article recommendation algorithm based on Word2Vec and Tfidf
    Wang, Rui
    Shi, Yuliang
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 454 - 457
  • [10] A News Recommendation Algorithm Based on Word2vec and Convolutional Neural Network
    Ding, Zhengqi
    Sun, Chang
    Sun, Gang
    Liu, Qihang
    Ma, Zhiyuan
    2022 THE 6TH INTERNATIONAL CONFERENCE ON VIRTUAL AND AUGMENTED REALITY SIMULATIONS, ICVARS 2022, 2022, : 96 - 100