Hot question prediction in Stack Overflow

被引:2
|
作者
Zhao, Li Xian [1 ]
Zhang, Li [1 ]
Jiang, Jing [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, 37 Xueyuan Rd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1004202, in part by the National Natural Science Foundation of China under Grant 61,732,019, in part by the State Key Laboratory of Software Development Environment under Grant SKLSDE‐2019ZX‐05, and in part by Fundamental Research Funds for the Central Universities under Grant No. YWF‐20‐BJ‐J‐1018;
D O I
10.1049/sfw2.12013
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Stack Overflow is a very popular programming question and answer community. Some questions become hot, and receive high views, which are of widespread concern to developers. Finding hot questions early can give priority to recommend potential hot questions to answers, thereby shortening the response time. Besides, the hot question prediction is also helpful for making advertising plan, planning advertising campaigns and estimating costs. Therefore, it is important to predict hot questions. The authors propose the VSAF method which analyses the View amount changes, Answer amount changes and Score changes soon after questions' creation based on Fully convolutional neural network. The performance of the VSAF method based on a training set and two different test sets has been evaluated. The training set has 1600 hot questions and 1600 cold questions. The random test set has 381 hot questions and 2819 cold questions, while the balanced test set has 400 hot questions and 400 cold questions. The experimental results show that using the balanced test set, VSAF achieves Accuracy, F1(hot) and F1(cold) of 80%, 77.77% and 81.81%, which outperforms the baseline approach by 25.59%, 21.52% and 29.04%, respectively. Using the random test set for evaluation, VSAF achieves Accuracy, F1(hot) and F1(cold) of 84.91%, 53.96% and 90.97%, which outperforms the baseline approach by 31.83%, 84.16% and 19.35%, respectively. The VSAF method significantly outperforms the state-of-the-art approach on hot question prediction.
引用
收藏
页码:90 / 106
页数:17
相关论文
共 50 条
  • [21] Improving Response Time Prediction for Stack Overflow Questions
    Wu, Di
    Johnson, Simon
    Foster, Chris
    Li, Erwin
    Elmiligi, Haytham
    Rahman, Musfiq
    2019 IEEE 10TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2019, : 786 - 791
  • [22] A survey on mining stack overflow: question and answering (Q&A) community
    Ahmad, Arshad
    Feng, Chong
    Ge, Shi
    Yousif, Abdallah
    DATA TECHNOLOGIES AND APPLICATIONS, 2018, 52 (02) : 190 - 247
  • [23] Generating Question Titles for Stack Overflow from Mined Code Snippets
    Gao, Zhipeng
    Xia, Xin
    Grundy, John
    Lo, David
    Li, Yuan-Fang
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2020, 29 (04)
  • [24] Attention-based model for predicting question relatedness on Stack Overflow
    Pei, Jiayan
    Wu, Yimin
    Qin, Zishan
    Cong, Yao
    Guan, Jingtao
    2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021), 2021, : 97 - 107
  • [25] Dataset of network simulator related-question posts in stack overflow
    Nugroho, Yusuf Sulistyo
    Islam, Syful
    Gunawan, Dedi
    Kurniawan, Yogiek Indra
    Hossain, Md Javed
    DATA IN BRIEF, 2022, 41
  • [26] Why Is Stack Overflow Failing? Preserving Sustainability in Community Question Answering
    Srba, Ivan
    Bielikova, Maria
    IEEE SOFTWARE, 2016, 33 (04) : 80 - 89
  • [27] Characterization and Prediction of Questions without Accepted Answers on Stack Overflow
    Yazdaninia, Mohamad
    Lo, David
    Sami, Ashkan
    2021 IEEE/ACM 29TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2021), 2021, : 59 - 70
  • [28] Employing Source Code Information to Improve Question-Answering in Stack Overflow
    Diamantopoulos, Themistoklis
    Symeonidis, Andreas L.
    12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015), 2015, : 454 - 457
  • [29] Automated Question Title Reformulation by Mining Modification Logs From Stack Overflow
    Liu, Ke
    Chen, Xiang
    Chen, Chunyang
    Xie, Xiaofei
    Cui, Zhanqi
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (09) : 4390 - 4410
  • [30] QTC4SO: Automatic Question Title Completion for Stack Overflow
    Zhou, Yanlin
    Yang, Shaoyu
    Chen, Xiang
    Zhang, Zichen
    Pei, Jiahua
    2023 IEEE/ACM 31ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2023, : 1 - 12