Hot question prediction in Stack Overflow

被引：2

作者：

Zhao, Li Xian ^{[1
]}

Zhang, Li ^{[1
]}

Jiang, Jing ^{[1
]}

机构：

[1] Beihang Univ, State Key Lab Software Dev Environm, 37 Xueyuan Rd, Beijing, Peoples R China

来源：

IET SOFTWARE | 2021年 / 15卷 / 01期

基金：

中国国家自然科学基金;

关键词：

This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1004202, in part by the National Natural Science Foundation of China under Grant 61,732,019, in part by the State Key Laboratory of Software Development Environment under Grant SKLSDE‐2019ZX‐05, and in part by Fundamental Research Funds for the Central Universities under Grant No. YWF‐20‐BJ‐J‐1018;

D O I：

10.1049/sfw2.12013

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Stack Overflow is a very popular programming question and answer community. Some questions become hot, and receive high views, which are of widespread concern to developers. Finding hot questions early can give priority to recommend potential hot questions to answers, thereby shortening the response time. Besides, the hot question prediction is also helpful for making advertising plan, planning advertising campaigns and estimating costs. Therefore, it is important to predict hot questions. The authors propose the VSAF method which analyses the View amount changes, Answer amount changes and Score changes soon after questions' creation based on Fully convolutional neural network. The performance of the VSAF method based on a training set and two different test sets has been evaluated. The training set has 1600 hot questions and 1600 cold questions. The random test set has 381 hot questions and 2819 cold questions, while the balanced test set has 400 hot questions and 400 cold questions. The experimental results show that using the balanced test set, VSAF achieves Accuracy, F1(hot) and F1(cold) of 80%, 77.77% and 81.81%, which outperforms the baseline approach by 25.59%, 21.52% and 29.04%, respectively. Using the random test set for evaluation, VSAF achieves Accuracy, F1(hot) and F1(cold) of 84.91%, 53.96% and 90.97%, which outperforms the baseline approach by 31.83%, 84.16% and 19.35%, respectively. The VSAF method significantly outperforms the state-of-the-art approach on hot question prediction.

引用

页码：90 / 106

页数：17

共 50 条

[21] Improving Response Time Prediction for Stack Overflow Questions
Wu, Di
Johnson, Simon
Foster, Chris
Li, Erwin
Elmiligi, Haytham
Rahman, Musfiq
2019 IEEE 10TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2019, : 786 - 791
[22] A survey on mining stack overflow: question and answering (Q&A) community
Ahmad, Arshad
Feng, Chong
Ge, Shi
Yousif, Abdallah
DATA TECHNOLOGIES AND APPLICATIONS, 2018, 52 (02) : 190 - 247
[23] Generating Question Titles for Stack Overflow from Mined Code Snippets
Gao, Zhipeng
Xia, Xin
Grundy, John
Lo, David
Li, Yuan-Fang
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2020, 29 (04)
[24] Attention-based model for predicting question relatedness on Stack Overflow
Pei, Jiayan
Wu, Yimin
Qin, Zishan
Cong, Yao
Guan, Jingtao
2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021), 2021, : 97 - 107
[25] Dataset of network simulator related-question posts in stack overflow
Nugroho, Yusuf Sulistyo
Islam, Syful
Gunawan, Dedi
Kurniawan, Yogiek Indra
Hossain, Md Javed
DATA IN BRIEF, 2022, 41
[26] Why Is Stack Overflow Failing? Preserving Sustainability in Community Question Answering
Srba, Ivan
Bielikova, Maria
IEEE SOFTWARE, 2016, 33 (04) : 80 - 89
[27] Characterization and Prediction of Questions without Accepted Answers on Stack Overflow
Yazdaninia, Mohamad
Lo, David
Sami, Ashkan
2021 IEEE/ACM 29TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2021), 2021, : 59 - 70
[28] Employing Source Code Information to Improve Question-Answering in Stack Overflow
Diamantopoulos, Themistoklis
Symeonidis, Andreas L.
12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015), 2015, : 454 - 457
[29] Automated Question Title Reformulation by Mining Modification Logs From Stack Overflow
Liu, Ke
Chen, Xiang
Chen, Chunyang
Xie, Xiaofei
Cui, Zhanqi
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (09) : 4390 - 4410
[30] QTC4SO: Automatic Question Title Completion for Stack Overflow
Zhou, Yanlin
Yang, Shaoyu
Chen, Xiang
Zhang, Zichen
Pei, Jiahua
2023 IEEE/ACM 31ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2023, : 1 - 12

← 1 2 3 4 5 →