Design and Implementation of Word2Vec Parallel Algorithm Based on HPC

被引:0
|
作者
Yi, Xianyong [1 ]
Zheng, Rongge [1 ]
Wang, Aoyu [1 ]
Qin, Hao [1 ]
Chen, Yufeng [1 ]
机构
[1] Shandong Univ, Sch Mech Elect & Informat Engn, Weihai, Weihai, Peoples R China
关键词
HPC; Word2Vec; Parallel Algorithm; Natural Language Processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word2Vec, (Word to Vector) processes natural language by calculating the cosine similarity. However, the serial algorithm of original Word2Vec fails to satisfy the demands of training of corpus text because of the explosive growth of information. It has become the bottleneck owing to its comparatively low processing efficiency. The High Performance Computing (HPC) specializes in improving the calculation efficiency; therefore, the training efficiency of corpus texts can be greatly improved by parallelizing Word2Vec algorithm. After analyzing the characteristics of the Word2Vec algorithm in detail, we design and implement a parallel Word2Vec algorithm and use it to train corpus text on HPC. Furthermore, the corpus texts of different sizes are collected and trained, and the speed-up ratio is calculated by using the serial algorithm and parallel algorithm of Word2Vec, respectively. The experimental results show that there is a higher speed-up ratio when using the Word2Vec parallel algorithm running on HPC.
引用
收藏
页码:585 / 590
页数:6
相关论文
共 50 条
  • [21] Matching Transportation Ontologies with Word2Vec and Alignment Extraction Algorithm
    Xue, Xingsi
    Wang, Haolin
    Zhang, Jie
    Huang, Yikun
    Li, Mengting
    Zhu, Hai
    JOURNAL OF ADVANCED TRANSPORTATION, 2021, 2021
  • [22] Research on the Construction of Sentiment Dictionary Based on Word2vec
    Song, Xiao-yu
    Zhao, Yang
    Jin, Li-ting
    Sun, Yue
    Liu, Tong
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [23] Construction Method of Sentiment Lexicon Based on Word2vec
    Yuan, Zhengwu
    Duan, Lian
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 848 - 851
  • [24] Encrypted Malicious Traffic Detection Based on Word2Vec
    Ferriyan, Andrey
    Thamrin, Achmad Husni
    Takeda, Keiji
    Murai, Jun
    ELECTRONICS, 2022, 11 (05)
  • [25] Research on Chinese Text Classification Based on Word2vec
    Yang, Zhi-Tong
    Zheng, Jun
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1166 - 1170
  • [26] A User Profile Modeling Method Based on Word2Vec
    Hu, Jianqiao
    Jin, Feng
    Zhang, Guigang
    Wang, Jian
    Yang, Yi
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 410 - 414
  • [27] Microblogging Short Text Classification based on Word2Vec
    Zhang, Yonghui
    Liu, Jingang
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ELECTRONIC, MECHANICAL, INFORMATION AND MANAGEMENT SOCIETY (EMIM), 2016, 40 : 395 - 401
  • [28] Text Coverless Information Hiding Based on Word2vec
    Long, Yi
    Liu, Yuling
    CLOUD COMPUTING AND SECURITY, PT IV, 2018, 11066 : 463 - 472
  • [29] Short Text Classification Based on Wikipedia and Word2vec
    Liu Wensen
    Cao Zewen
    Wang Jun
    Wang Xiaoyi
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1195 - 1200
  • [30] Similarity Analysis of Law Documents Based on Word2vec
    Xia, Chunyu
    He, Tieke
    Li, Wenlong
    Qin, Zemin
    Zou, Zhipeng
    2019 COMPANION OF THE 19TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS-C 2019), 2019, : 354 - 357