Evaluating Unsupervised Text Embeddings on Software User Feedback

被引:11
|
作者
Devine, Peter [1 ]
Koh, Yun Sing [1 ]
Blincoe, Kelly [1 ]
机构
[1] Univ Auckland, Auckland, New Zealand
关键词
REVIEWS; MODELS;
D O I
10.1109/REW53955.2021.00020
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
User feedback on software products has been shown to be useful for development and can be exceedingly abundant online. Many approaches have been developed to elicit requirements in different ways from this large volume of feedback, including the use of unsupervised clustering, underpinned by text embeddings. Methods for embedding text can vary significantly within the literature, highlighting the lack of a consensus as to which approaches are best able to cluster user feedback into requirements relevant groups. This work proposes a methodology for comparing text embeddings of user feedback using existing labelled datasets. Using 7 diverse datasets from the literature, we apply this methodology to evaluate both established text embedding techniques from the user feedback analysis literature (including topic modelling and word embeddings) as well as text embeddings from state of the art deep text embedding models. Results demonstrate that text embeddings produced by state of the art models, most notably the Universal Sentence Encoder (USE), group feedback with similar requirements relevant characteristics together better than other evaluated techniques across all seven datasets. These results can help researchers select appropriate embedding techniques when developing future unsupervised clustering approaches within user feedback analysis.
引用
收藏
页码:87 / 95
页数:9
相关论文
共 50 条
  • [21] On the Reliability of User Feedback for Evaluating the Quality of Conversational Agents
    Massiah, Jordan
    Yilmaz, Emine
    Jiao, Yunlong
    Kazai, Gabriella
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4185 - 4189
  • [22] Designing a Study for Evaluating User Feedback on Predesign Models
    Voehringer, Juergen
    Bellstroem, Peter
    Gaelle, Doris
    Kop, Christian
    INFORMATION SYSTEMS DEVELOPMENT: ASIAN EXPERIENCES, 2011, : 411 - 423
  • [23] Evaluating a software to help students to understand scientific text
    Crinon, Jacques
    Legros, Denis
    Marin, Brigitte
    Avel, Patrick
    ALSIC-APPRENTISSAGE DES LANGUES ET SYSTEMS D INFORMATION ET DE COMMUNICATION, 2007, 10 (01): : 51 - 64
  • [24] Unsupervised Embeddings for Categorical Variables
    De Meulemeester, Hannes
    De Moor, Bart
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [25] Unsupervised Multilingual Word Embeddings
    Chen, Xilun
    Cardie, Claire
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 261 - 270
  • [26] Evaluating pre-trained models for user feedback analysis in software engineering: a study on classification of app-reviews
    Hadi, Mohammad A.
    Fard, Fatemeh H. H.
    EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (04)
  • [27] Evaluating pre-trained models for user feedback analysis in software engineering: a study on classification of app-reviews
    Mohammad A Hadi
    Fatemeh H. Fard
    Empirical Software Engineering, 2023, 28
  • [28] Webification of software development: User feedback for developer’s modeling
    Kuric, Eduard
    Bieliková, Maria
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8541 : 550 - 553
  • [29] Context-Aware User Feedback in Continuous Software Evolution
    Dzvonyar, Dora
    Krusche, Stephan
    Alkadhi, Rana
    Bruegge, Bernd
    INTERNATIONAL WORKSHOP ON CONTINUOUS SOFTWARE EVOLUTION AND DELIVERY, CSED 2016, 2016, : 12 - 18
  • [30] Webification of Software Development: User Feedback for Developer's Modeling
    Kuric, Eduard
    Bielikova, Maria
    WEB ENGINEERING, ICWE 2014, 2014, 8541 : 550 - 553