Hot question prediction in Stack Overflow

被引:2
|
作者
Zhao, Li Xian [1 ]
Zhang, Li [1 ]
Jiang, Jing [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, 37 Xueyuan Rd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1004202, in part by the National Natural Science Foundation of China under Grant 61,732,019, in part by the State Key Laboratory of Software Development Environment under Grant SKLSDE‐2019ZX‐05, and in part by Fundamental Research Funds for the Central Universities under Grant No. YWF‐20‐BJ‐J‐1018;
D O I
10.1049/sfw2.12013
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stack Overflow is a very popular programming question and answer community. Some questions become hot, and receive high views, which are of widespread concern to developers. Finding hot questions early can give priority to recommend potential hot questions to answers, thereby shortening the response time. Besides, the hot question prediction is also helpful for making advertising plan, planning advertising campaigns and estimating costs. Therefore, it is important to predict hot questions. The authors propose the VSAF method which analyses the View amount changes, Answer amount changes and Score changes soon after questions' creation based on Fully convolutional neural network. The performance of the VSAF method based on a training set and two different test sets has been evaluated. The training set has 1600 hot questions and 1600 cold questions. The random test set has 381 hot questions and 2819 cold questions, while the balanced test set has 400 hot questions and 400 cold questions. The experimental results show that using the balanced test set, VSAF achieves Accuracy, F1(hot) and F1(cold) of 80%, 77.77% and 81.81%, which outperforms the baseline approach by 25.59%, 21.52% and 29.04%, respectively. Using the random test set for evaluation, VSAF achieves Accuracy, F1(hot) and F1(cold) of 84.91%, 53.96% and 90.97%, which outperforms the baseline approach by 31.83%, 84.16% and 19.35%, respectively. The VSAF method significantly outperforms the state-of-the-art approach on hot question prediction.
引用
收藏
页码:90 / 106
页数:17
相关论文
共 50 条
  • [1] Quality Prediction of a Stack Overflow Question Using Machine Learning
    Mehta, Tanvi
    Multaikar, Samruddhi
    Patil, Srushti
    Gawande, Namrata
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 65 - 80
  • [2] Cross-Community Question Relevance Prediction for Stack Overflow and GitHub
    Yu, Song
    Jiang, Bugao
    Zhang, Danni
    Liao, Zhifang
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2025, 31 (01) : 52 - 71
  • [3] Question Similarity Detection on Stack Overflow Sites
    Botto-Tobar, Miguel
    2022 XVLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2022), 2022,
  • [4] Who will Answer my Question on Stack Overflow?
    Choetkiertikul, Morakot
    Avery, Daniel
    Dam, Hoa Khanh
    Tran, Truyen
    Ghose, Aditya
    2015 24TH AUSTRALASIAN SOFTWARE ENGINEERING CONFERENCE (ASWEC 2015), 2015, : 155 - 164
  • [5] An empirical study of question discussions on Stack Overflow
    Wenhan Zhu
    Haoxiang Zhang
    Ahmed E. Hassan
    Michael W. Godfrey
    Empirical Software Engineering, 2022, 27
  • [6] An empirical study of question discussions on Stack Overflow
    Zhu, Wenhan
    Zhang, Haoxiang
    Hassan, Ahmed E.
    Godfrey, Michael W.
    EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (06)
  • [7] Bug severity prediction using question-and-answer pairs from Stack Overflow
    Tan, Youshuai
    Xu, Sijie
    Wang, Zhaowei
    Zhang, Tao
    Xu, Zhou
    Luo, Xiapu
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 165
  • [8] It Takes Two to Tango: Deleted Stack Overflow Question Prediction with Text and Meta Features
    Xia, Xin
    Lo, David
    Correa, Denzil
    Sureka, Ashish
    Shihab, Emad
    PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS, VOL 1, 2016, : 73 - 82
  • [9] Duplicate Question Detection in Stack Overflow: A Reproducibility Study
    Silva, Rodrigo F. G.
    Paixao, Klerisson
    Maia, Marcelo de Almeida
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), 2018, : 572 - 581
  • [10] Automatically Classifying Posts into Question Categories on Stack Overflow
    Beyer, Stefanie
    Macho, Christian
    Pinzger, Martin
    Di Penta, Massimiliano
    2018 IEEE/ACM 26TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2018), 2018, : 211 - 221