PEPT: Expert Finding Meets Personalized Pre-Training

被引：0

作者：

Peng, Qiyao ^{[1
]}

Xu, Hongyan ^{[2
]}

Wang, Yinghui ^{[3
]}

Liu, Hongtao ^{[4
]}

Huo, Cuiying ^{[5
]}

Wang, Wenjun ^{[5
,6
]}

机构：

[1] Tianjin Univ, Sch New Media & Commun, Tianjin, Peoples R China

[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[3] Beijing Inst Control & Elect Technol, Key Lab Informat Syst & Technol, Beijing, Peoples R China

[4] Du Xiaoman Technol, Beijing, Peoples R China

[5] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China

[6] Hainan Trop Ocean Univ, YazhouBay Innovat Inst, Hainan, Peoples R China

来源：

ACM TRANSACTIONS ON INFORMATION SYSTEMS | 2024年 / 43卷 / 01期

关键词：

Contrastive Learning;

D O I：

10.1145/3690380

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Finding experts is essential in Community Question Answering (CQA) platforms as it enables the effective routing of questions to potential users who can provide relevant answers. The key is to personalized learning expert representations based on their historical answered questions, and accurately matching them with target questions. Recently, the applications of Pre-Trained Language Models (PLMs) have gained significant attraction due to their impressive capability to comprehend textual data, and are widespread used across various domains. There have been some preliminary works exploring the usability of PLMs in expert finding, such as pre-training expert or question representations. However, these models usually learn pure text representations of experts from histories, disregarding personalized and fine-grained expert modeling. For alleviating this, we present a personalized pre-training and fine-tuning paradigm, which could effectively learn expert interest and expertise simultaneously. Specifically, in our pre-training framework, we integrate historical answered questions of one expert with one target question, and regard it as a candidate-aware expert-level input unit. Then, we fuse expert IDs into the pre-training for guiding the model to model personalized expert representations, which can help capture the unique characteristics and expertise of each individual expert. Additionally, in our pre-training task, we design (1) a question-level masked language model task to learn the relatedness between histories, enabling the modeling of question-level expert interest; (2) a vote-oriented task to capture question-level expert expertise by predicting the vote score the expert would receive. Through our pre-training framework and tasks, our approach could holistically learn expert representations including interests and expertise. Our method has been extensively evaluated on six real-world CQA datasets, and the experimental results consistently demonstrate the superiority of our approach over competitive baseline methods.

引用

页数：26

共 50 条

[31] Pre-training Assessment Through the Web
Kenneth Wong
Reggie Kwan
Jimmy SF Chan
厦门大学学报(自然科学版), 2002, (S1) : 297 - 297
[32] Structure-inducing pre-training
McDermott, Matthew B. A.
Yap, Brendan
Szolovits, Peter
Zitnik, Marinka
NATURE MACHINE INTELLIGENCE, 2023, 5 (06) : 612 - +
[33] Unsupervised Pre-Training for Detection Transformers
Dai, Zhigang
Cai, Bolun
Lin, Yugeng
Chen, Junying
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 12772 - 12782
[34] Understanding tables with intermediate pre-training
Eisenschlos, Julian Martin
Krichene, Syrine
Mueller, Thomas
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
[35] On Masked Pre-training and the Marginal Likelihood
Moreno-Munoz, Pablo
Recasens, Pol G.
Hauberg, Soren
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[36] Speech Pre-training with Acoustic Piece
Ren, Shuo
Liu, Shujie
Wu, Yu
Zhou, Long
Wei, Furu
INTERSPEECH 2022, 2022, : 2648 - 2652
[37] Ontology Pre-training for Poison Prediction
Glauer, Martin
Neuhaus, Fabian
Mossakowski, Till
Hastings, Janna
ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2023, 2023, 14236 : 31 - 45
[38] Realistic Channel Models Pre-training
Huangfu, Yourui
Wang, Jian
Xu, Chen
Li, Rong
Ge, Yiqun
Wang, Xianbin
Zhang, Huazi
Wang, Jun
2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
[39] Blessing of Class Diversity in Pre-training
Zhao, Yulai
Chen, Jianshu
Du, Simon S.
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206 : 283 - 305
[40] Rethinking pre-training on medical imaging
Wen, Yang
Chen, Leiting
Deng, Yu
Zhou, Chuan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 78

← 1 2 3 4 5 →