A feature location approach for mapping application features extracted from crowd-based screencasts to source code

被引：0

作者：

Parisa Moslehi

Bram Adams

Juergen Rilling

机构：

[1] Concordia Universitys,

[2] Queen’s University,undefined

来源：

Empirical Software Engineering | 2020年 / 25卷

关键词：

Crowd-based documentation; Mining video content; Speech analysis; Feature location; Software traceability; Information extraction; Software documentation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Crowd-based multimedia documents such as screencasts have emerged as a source for documenting requirements, the workflow and implementation issues of open source and agile software projects. For example, users can show and narrate how they manipulate an application’s GUI to perform a certain functionality, or a bug reporter could visually explain how to trigger a bug or a security vulnerability. Unfortunately, the streaming nature of programming screencasts and their binary format limit how developers can interact with a screencast’s content. In this research, we present an automated approach for mining and linking the multimedia content found in screencasts to their relevant software artifacts and, more specifically, to source code. We apply LDA-based mining approaches that take as input a set of screencast artifacts, such as GUI text and spoken word, to make the screencast content accessible and searchable to users and to link it to their relevant source code artifacts. To evaluate the applicability of our approach, we report on results from case studies that we conducted on existing WordPress and Mozilla Firefox screencasts. We found that our automated approach can significantly speed up the feature location process. For WordPress, we find that our approach using screencast speech and GUI text can successfully link relevant source code files within the top 10 hits of the result set with median Reciprocal Rank (RR) of 50% (rank 2) and 100% (rank 1). In the case of Firefox, our approach can identify relevant source code directories within the top 100 hits using screencast speech and GUI text with the median RR = 20%, meaning that the first true positive is ranked 5 or higher in more than 50% of the cases. Also, source code related to the frontend implementation that handles high-level or GUI-related aspects of an application is located with higher accuracy. We also found that term frequency rebalancing can further improve the linking results when using less noisy scenarios or locating less technical implementation of scenarios. Investigating the results of using original and weighted screencast data sources (speech, GUI, speech and GUI) that can result in having the highest median RR values in both case studies shows that speech data is an important information source that can result in having RR of 100%.

引用

页码：4873 / 4926

页数：53

共 45 条

[41] Building an open source classifier for the neonatal EEG background: a systematic feature-based approach from expert scoring to clinical visualization (vol 15, 675154, 2021)
Montazeri, Saeed
Pinchefsky, Elana
Tse, Ilse
Marchi, Viviana
Kohonen, Jukka
Kauppila, Minna
Airaksinen, Manu
Tapani, Karoliina
Nevalainen, Paeivi
Hahn, Cecil
Tam, Emily W. Y.
Stevenson, Nathan J.
Vanhatalo, Sampsa
FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
[42] Mapping reservoir water quality from Sentinel-2 satellite data based on a new approach of weighted averaging: Application of Bayesian maximum entropy
Nikoo, Mohammad Reza
Zamani, Mohammad G.
Zadeh, Mahshid Mohammad
Al-Rawas, Ghazi
Al-Wardy, Malik
Gandomi, Amir H.
SCIENTIFIC REPORTS, 2024, 14 (01):
[43] Feature Extraction From Impedance Spectrum of Au/ZrO2 Nanotube/Zr-Based MIM Sensor for Selective Discrimination of VOCs: A Potential Approach for E-Nose Application
Thakur, Uttam Narendra
Hazra, Arnab
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 1
[44] An Integrated Approach for Mapping Three-Dimensional CoSeismic Displacement Fields from Sentinel-1 TOPS Data Based on DInSAR, POT, MAI and BOI Techniques: Application to the 2021 Mw 7.4 Maduo Earthquake
Xu, Lang
Chen, Qiang
Zhao, Jing-Jing
Liu, Xian-Wen
Xu, Qian
Yang, Ying-Hui
REMOTE SENSING, 2021, 13 (23)
[45] Development of Simulation-Based Approach Using Frame Models Generated From GIS Features and BIM Data for Application to City Seismic Response Analysis of Low- to Mid-Rise RC Structures in Metro Manila
Quinay, Pher Errol B.
Soliman, John Michael M.
Fader, Aileen Rachelle F.
JOURNAL OF EARTHQUAKE AND TSUNAMI, 2020, 14 (06)

← 1 2 3 4 5 →