Learning heterogeneous subgraph representations for team discovery

被引:2
|
作者
Hamidi Rad, Radin [1 ]
Nguyen, Hoang [1 ]
Al-Obeidat, Feras [2 ]
Bagheri, Ebrahim [1 ]
Kargar, Mehdi [1 ]
Srivastava, Divesh [3 ]
Szlichta, Jaroslaw [4 ]
Zarrinkalam, Fattane [5 ]
机构
[1] Toronto Metropolitan Univ, Toronto, ON, Canada
[2] Zayed Univ, Dubai, U Arab Emirates
[3] AT & T Chief Data Off, Bedminster, NJ USA
[4] York Univ, Toronto, ON, Canada
[5] Univ Guelph, Guelph, ON, Canada
来源
INFORMATION RETRIEVAL JOURNAL | 2023年 / 26卷 / 1-2期
基金
加拿大自然科学与工程研究理事会;
关键词
Expert search; Heterogeneous graph embeddings; Task assignment; Team discovery; SELECTION; SEARCH;
D O I
10.1007/s10791-023-09421-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The team discovery task is concerned with finding a group of experts from a collaboration network who would collectively cover a desirable set of skills. Most prior work for team discovery either adopt graph-based or neural mapping approaches. Graph-based approaches are computationally intractable often leading to sub-optimal team selection. Neural mapping approaches have better performance, however, are still limited as they learn individual representations for skills and experts and are often prone to overfitting given the sparsity of collaboration networks. Thus, we define the team discovery task as one of learning subgraph representations from a heterogeneous collaboration network where the subgraphs represent teams which are then used to identify relevant teams for a given set of skills. As such, our approach captures local (node interactions with each team) and global (subgraph interactions between teams) characteristics of the representation network and allows us to easily map between any homogeneous and heterogeneous subgraphs in the network to effectively discover teams. Our experiments over two real-world datasets from different domains, namely DBLP bibliographic dataset with 10,647 papers and IMDB with 4882 movies, illustrate that our approach outperforms the state-of-the-art baselines on a range of ranking and quality metrics. More specifically, in terms of ranking metrics, we are superior to the best baseline by approximately 15% on the DBLP dataset and by approximately 20% on the IMDB dataset. Further, our findings illustrate that our approach consistently shows a robust performance improvement over the baselines.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Learning heterogeneous subgraph representations for team discovery
    Radin Hamidi Rad
    Hoang Nguyen
    Feras Al-Obeidat
    Ebrahim Bagheri
    Mehdi Kargar
    Divesh Srivastava
    Jaroslaw Szlichta
    Fattane Zarrinkalam
    Information Retrieval Journal, 2023, 26
  • [2] Subgraph Representation Learning for Team Mining
    Rad, Radin Hamidi
    Bagheri, Ebrahim
    Kargar, Mehdi
    Srivastava, Divesh
    Szlichta, Jaroslaw
    PROCEEDINGS OF THE 14TH ACM WEB SCIENCE CONFERENCE, WEBSCI 2022, 2022, : 148 - 153
  • [3] Learning Rule-Induced Subgraph Representations for Inductive Relation Prediction
    Liu, Tianyu
    Lv, Qitan
    Wang, Jie
    Yang, Shuling
    Chen, Hanzhu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Frequent subgraph discovery
    Kuramochi, M
    Karypis, G
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 313 - 320
  • [5] HMSG: Heterogeneous graph neural network based on Metapath SubGraph learning
    Guan, Mengya
    Cai, Xinjun
    Shang, Jiaxing
    Hao, Fei
    Liu, Dajiang
    Jiao, Xianlong
    Ni, Wancheng
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [6] Neuroevolutionary representations for learning heterogeneous treatment effects
    Burkhart, Michael C.
    Ruiz, Gabriel
    JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 71
  • [7] DyHNet: Learning dynamic heterogeneous network representations
    Nguyen, Hoang
    Rad, Radin Hamidi
    Zarrinkalam, Fattane
    Bagheri, Ebrahim
    INFORMATION SCIENCES, 2023, 646
  • [8] An Efficient System for Subgraph Discovery
    Joshi, Aparna
    Zhang, Yu
    Bogdanov, Petko
    Hwang, Jeong-Hyon
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 703 - 712
  • [9] Robust Densest Subgraph Discovery
    Miyauchi, Atsushi
    Takeda, Akiko
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1188 - 1193
  • [10] On Directed Densest Subgraph Discovery
    Ma, Chenhao
    Fang, Yixiang
    Cheng, Reynold
    Lakshmanan, Laks V. S.
    Zhang, Wenjie
    Lin, Xuemin
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2021, 46 (04):