Topic recommendation using Doc2Vec

被引:0
|
作者
Karvelis, Petros [1 ]
Gavrilis, Dimitris [2 ]
Georgoulas, George [3 ]
Stylios, Chrysostomos [1 ]
机构
[1] Technol Educ Inst Epirus, Lab Knowledge & Intelligent Comp, Dept Comp Engn, Arta, Greece
[2] Univ Patras, Dept Elect Engn & Comp Technol, Patras, Greece
[3] Lulea Univ Technol, Control Engn Grp, Dept Comp Sci Elect & Space Engn, Lulea, Sweden
关键词
Recommender system; multilabel classification; word2vec; doc2vec; bag of words;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ever-increasing number of electronic content stored in digital libraries requires a significant amount of effort in cataloguing and has led to self-deposit solutions where the authors submit and publish their own digital records. Even in self-deposit, going through the abstract and assigning subject terms or keywords is a time consuming and expensive process, yet crucial for the metadata quality of the record that affects retrieval. Therefore, an automatic, or even a semi-automatic process that can recommend topics for a new entry is of huge practical value. A system that can address that has to rely basically on two components, one component for efficiently representing the relevant information of the new document and one component for recommending an appropriate set of topics based on the representation of the previous stage. In this work, different candidate solutions for both components are investigated and compared. For the first stage both distributed Document to Vector (doc2vec) and conventional Bag of Words (BoW) components are employed, while for the latter two different transformation approaches from the field of multi-label classification are compared. For the comparison, a collection of Ph.D. abstracts (similar to 19000 documents) from the MIT Libraries Dspace repository is used suggesting that different combinations can provide high quality solutions.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Bangla news recommendation using doc2vec
    Nandi, Rabindra Nath
    Zaman, M. M. Arefin
    Al Muntasir, Tareq
    Sumit, Sakhawat Hosain
    Sourov, Tanvir
    Rahman, Md. Jamil-Ur
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [2] Using Collaborative Filtering Algorithms Combined with Doc2Vec for Movie Recommendation
    Liu, Gaojun
    Wu, Xingyu
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1461 - 1464
  • [3] Unsupervised News Topic Modelling with Doc2Vec and Spherical Clustering
    Budiarto, Arif
    Rahutomo, Reza
    Putra, Hendra Novyantara
    Cenggoro, Tjeng Wawan
    Kacamarga, Muhamad Fitra
    Pardamean, Bens
    5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 40 - 46
  • [4] Mining Stack Overflow for API class recommendation using DOC2VEC and LDA
    Lee, Wai Keat
    Su, Moon Ting
    IET SOFTWARE, 2021, 15 (05) : 308 - 322
  • [5] Recommendation method for academic journal submission based on doc2vec and XGBoost
    Huang ZhengWei
    Min JinTao
    Yang YanNi
    Huang Jin
    Tian Ye
    Scientometrics, 2022, 127 : 2381 - 2394
  • [6] Recommendation method for academic journal submission based on doc2vec and XGBoost
    Huang Zhengwei
    Min Jintao
    Yang Yanni
    Huang Jin
    Tian Ye
    SCIENTOMETRICS, 2022, 127 (05) : 2381 - 2394
  • [7] Poem Generation using Transformers and Doc2Vec Embeddings
    Santillan, Marvin C.
    Azcarraga, Arnulfo P.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [8] Web Service Recommendation based on Knowledge Graph Convolutional Network and Doc2Vec
    Geng, Jinkun
    Cao, Buqing
    Ye, Hongfan
    Chen, Junjie
    Peng, Mi
    Liu, Jianxun
    2020 IEEE WORLD CONGRESS ON SERVICES (SERVICES), 2020, : 95 - 100
  • [9] Who is the Ringleader? Modelling Influence in Discourse using Doc2Vec
    Vyas, Priyank
    Smith, Tony
    Feldman, Philip
    Dant, Aaron
    Calude, Andreea
    Patros, Panos
    2021 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2021), 2021, : 299 - 300
  • [10] Semantic Detection of Targeted Attacks Using DOC2VEC Embedding
    El-Rahmany, Mariam S.
    Mohamed, Ensaf Hussein
    Haggag, Mohamed H.
    JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2021, 17 (04) : 334 - 341