Towards an Accurate Prediction of the Question Quality on Stack Overflow using a Deep-Learning-Based NLP Approach

被引:10
|
作者
Toth, Laszlo [1 ]
Nagy, Balazs [1 ]
Jantho, David [1 ]
Vidacs, Laszlo [1 ,2 ]
Gyimothy, Tibor [1 ,2 ]
机构
[1] Univ Szeged, Dept Software Engn, Szeged, Hungary
[2] Univ Szeged, MTA SZTE Res Grp Artificial Intelligence, Szeged, Hungary
关键词
Question Answering; Q&A; Stack Overflow; Quality; Natural Language Processing; NLP; Deep Learning; Doc2Vec;
D O I
10.5220/0007971306310639
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Online question answering (Q&A) forums like Stack Overflow have been playing an increasingly important role in supporting the daily tasks of developers. Stack Overflow can be considered as a meeting point of experienced developers and those who are looking for a solution for a specific problem. Since anyone with any background and experience level can ask and respond to questions, the community tries to use different solutions to maintain quality, such as closing and deleting inappropriate posts. As over 8,000 posts arrive on Stack Overflow every day, the effective automatic filtering of them is essential. In this paper, we present a novel approach for classifying questions based exclusively on their linguistic and semantic features using deep learning method. Our binary classifier relying on the textual properties of posts can predict whether the question is to be closed with an accuracy of 74% similar to the results of previous metrics-based models. In accordance with our findings we conclude that by combining deep learning and natural language processing methods, the maintenance of quality at Q&A forums could be supported using only the raw text of posts.
引用
收藏
页码:631 / 639
页数:9
相关论文
共 50 条
  • [11] Bug severity prediction using question-and-answer pairs from Stack Overflow
    Tan, Youshuai
    Xu, Sijie
    Wang, Zhaowei
    Zhang, Tao
    Xu, Zhou
    Luo, Xiapu
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 165
  • [12] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
    Henghui Zhao
    Yanhui Li
    Fanwei Liu
    Xiaoyuan Xie
    Lin Chen
    Science China Information Sciences, 2021, 64
  • [13] State and tendency: an empirical study of deep learning question&answer topics on Stack Overflow
    Zhao, Henghui
    Li, Yanhui
    Liu, Fanwei
    Xie, Xiaoyuan
    Chen, Lin
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (11)
  • [14] Stack Overflow Considered Helpful! Deep Learning Security Nudges Towards Stronger Cryptography
    Fischer, Felix
    Xiao, Huang
    Kao, Ching-Yu
    Stachelscheid, Yannick
    Johnson, Benjamin
    Razar, Danial
    Fawkesley, Paul
    Buckley, Nat
    Boettinger, Konstantin
    Muntean, Paul
    Grossklags, Jens
    PROCEEDINGS OF THE 28TH USENIX SECURITY SYMPOSIUM, 2019, : 339 - 356
  • [15] An Integrated Deep-Learning-Based Approach for Energy Consumption Prediction of Machining Systems
    Zhang, Meihang
    Zhang, Hua
    Yan, Wei
    Jiang, Zhigang
    Zhu, Shuo
    SUSTAINABILITY, 2023, 15 (07)
  • [16] Prediction of Relatedness in Stack Overflow: Deep Learning vs. SVM A Reproducibility Study
    Xu, Bowen
    Shirani, Amirreza
    Lo, David
    Alipour, Mohammad Amin
    PROCEEDINGS OF THE 12TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT (ESEM 2018), 2018,
  • [17] Synchronization and Detection in Molecular Communication Using a Deep-Learning-Based Approach
    Casaleiro, Duarte
    Souto, Nuno M. B.
    Silva, Joao C.
    IEEE ACCESS, 2024, 12 : 192539 - 192553
  • [18] Recency and quality-based ranking question in CQAs: A Stack Overflow case study
    Amancio, Leandro
    Dorneles, Carina F.
    Dalip, Daniel H.
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [19] Deep-learning-based vehicle trajectory prediction: A review
    Yin, Chenhui
    Cecotti, Marco
    Auger, Daniel J.
    Fotouhi, Abbas
    Jiang, Haobin
    IET INTELLIGENT TRANSPORT SYSTEMS, 2025, 19 (01)
  • [20] A deep-learning-based approach towards identifying combined faults in structural health monitoring
    Al-Nasser, Heba
    Al-Zuriqat, Thamer
    Dragos, Kosmas
    Chillón Geck, Carlos
    Smarsly, Kay
    e-Journal of Nondestructive Testing, 2024, 29 (07):