Hot question prediction in Stack Overflow

被引：2

作者：

Zhao, Li Xian ^{[1
]}

Zhang, Li ^{[1
]}

Jiang, Jing ^{[1
]}

机构：

[1] Beihang Univ, State Key Lab Software Dev Environm, 37 Xueyuan Rd, Beijing, Peoples R China

来源：

IET SOFTWARE | 2021年 / 15卷 / 01期

基金：

中国国家自然科学基金;

关键词：

This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1004202, in part by the National Natural Science Foundation of China under Grant 61,732,019, in part by the State Key Laboratory of Software Development Environment under Grant SKLSDE‐2019ZX‐05, and in part by Fundamental Research Funds for the Central Universities under Grant No. YWF‐20‐BJ‐J‐1018;

D O I：

10.1049/sfw2.12013

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Stack Overflow is a very popular programming question and answer community. Some questions become hot, and receive high views, which are of widespread concern to developers. Finding hot questions early can give priority to recommend potential hot questions to answers, thereby shortening the response time. Besides, the hot question prediction is also helpful for making advertising plan, planning advertising campaigns and estimating costs. Therefore, it is important to predict hot questions. The authors propose the VSAF method which analyses the View amount changes, Answer amount changes and Score changes soon after questions' creation based on Fully convolutional neural network. The performance of the VSAF method based on a training set and two different test sets has been evaluated. The training set has 1600 hot questions and 1600 cold questions. The random test set has 381 hot questions and 2819 cold questions, while the balanced test set has 400 hot questions and 400 cold questions. The experimental results show that using the balanced test set, VSAF achieves Accuracy, F1(hot) and F1(cold) of 80%, 77.77% and 81.81%, which outperforms the baseline approach by 25.59%, 21.52% and 29.04%, respectively. Using the random test set for evaluation, VSAF achieves Accuracy, F1(hot) and F1(cold) of 84.91%, 53.96% and 90.97%, which outperforms the baseline approach by 31.83%, 84.16% and 19.35%, respectively. The VSAF method significantly outperforms the state-of-the-art approach on hot question prediction.

引用

页码：90 / 106

页数：17

共 50 条

[1] Quality Prediction of a Stack Overflow Question Using Machine Learning
Mehta, Tanvi
Multaikar, Samruddhi
Patil, Srushti
Gawande, Namrata
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 65 - 80
[2] Cross-Community Question Relevance Prediction for Stack Overflow and GitHub
Yu, Song
Jiang, Bugao
Zhang, Danni
Liao, Zhifang
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2025, 31 (01) : 52 - 71
[3] Question Similarity Detection on Stack Overflow Sites
Botto-Tobar, Miguel
2022 XVLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2022), 2022,
[4] Who will Answer my Question on Stack Overflow?
Choetkiertikul, Morakot
Avery, Daniel
Dam, Hoa Khanh
Tran, Truyen
Ghose, Aditya
2015 24TH AUSTRALASIAN SOFTWARE ENGINEERING CONFERENCE (ASWEC 2015), 2015, : 155 - 164
[5] An empirical study of question discussions on Stack Overflow
Wenhan Zhu
Haoxiang Zhang
Ahmed E. Hassan
Michael W. Godfrey
Empirical Software Engineering, 2022, 27
[6] An empirical study of question discussions on Stack Overflow
Zhu, Wenhan
Zhang, Haoxiang
Hassan, Ahmed E.
Godfrey, Michael W.
EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (06)
[7] Bug severity prediction using question-and-answer pairs from Stack Overflow
Tan, Youshuai
Xu, Sijie
Wang, Zhaowei
Zhang, Tao
Xu, Zhou
Luo, Xiapu
JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 165
[8] It Takes Two to Tango: Deleted Stack Overflow Question Prediction with Text and Meta Features
Xia, Xin
Lo, David
Correa, Denzil
Sureka, Ashish
Shihab, Emad
PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS, VOL 1, 2016, : 73 - 82
[9] Duplicate Question Detection in Stack Overflow: A Reproducibility Study
Silva, Rodrigo F. G.
Paixao, Klerisson
Maia, Marcelo de Almeida
2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), 2018, : 572 - 581
[10] Automatically Classifying Posts into Question Categories on Stack Overflow
Beyer, Stefanie
Macho, Christian
Pinzger, Martin
Di Penta, Massimiliano
2018 IEEE/ACM 26TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2018), 2018, : 211 - 221

← 1 2 3 4 5 →