Finding Trends in Software Research

被引:13
|
作者
Mathew, George [1 ]
Agrawal, Amritanshu [1 ]
Menzies, Tim [1 ]
机构
[1] North Carolina State Univ NCSU, Dept Comp Sci CS, Raleigh, NC 27695 USA
关键词
Software engineering; Conferences; Software; Analytical models; Data models; Predictive models; Testing; bibliometrics; topic modeling; text mining; RESEARCH TOPICS; INSTITUTIONS; EVOLUTION; RANKING; GENDER;
D O I
10.1109/TSE.2018.2870388
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Text mining methods can find large scale trends within research communities. For example, using stable Latent Dirichlet Allocation (a topic modeling algorithm) this study found 10 major topics in 35,391 SE research papers from 34 leading SE venues over the last 25 years (divided, evenly, between conferences and journals). Out study also shows how those topics have changed over recent years. Also, we note that (in the historical record) mono-focusing on a single topic can lead to fewer citations than otherwise. Further, while we find no overall gender bias in SE authorship, we note that women are under-represented in the top-most cited papers in our field. Lastly, we show a previously unreported dichotomy between software conferences and journals (so research topics that succeed at conferences might not succeed at journals, and vice versa). An important aspect of this work is that it is automatic and quickly repeatable (unlike prior SE bibliometric studies that used tediously slow and labor intensive methods). Automation is important since, like any data mining study, its conclusions are skewed by the data used in the analysis. The automatic methods of this paper make it far easier for other researchers to re-apply the analysis to new data, or if they want to use different modeling assumptions.
引用
收藏
页码:1397 / 1410
页数:14
相关论文
共 50 条
  • [41] FINDING THE RIGHT DISTRIBUTION SOFTWARE
    COHODAS, MJ
    DATAMATION, 1991, 37 (14): : 61 - 62
  • [42] SOFTWARE FOR FINDING GROUPS IN DATA
    ROUSSEEUW, PJ
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1991, 10 (06) : 175 - 176
  • [43] Finding a history for software engineering
    Mahoney, MS
    IEEE ANNALS OF THE HISTORY OF COMPUTING, 2004, 26 (01) : 8 - 19
  • [44] Use of Qualitative Research to Generate a Function for Finding the Unit Cost of Software Test Cases
    Gillenson, Mark L.
    Stafford, Thomas F.
    Zhang, Xihui
    Shi, Yao
    JOURNAL OF DATABASE MANAGEMENT, 2020, 31 (02) : 42 - 63
  • [45] Systematic Literature Review of Software Effort Estimation : Research Trends, Methods, and Datasets
    Hariyanto
    Marjuni, Aris
    Rijati, Nova
    Hasibuan, Zainal Arifin
    Proceedings - 2024 International of Seminar on Application for Technology of Information and Communication: Smart And Emerging Technology for a Better Life, iSemantic 2024, 2024, : 471 - 476
  • [46] Software Engineering Research Trends 1994-2024: Stepping Beyond the Lamppost
    Medvidovic, Nenad
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2025, 51 (03) : 685 - 688
  • [47] Evolution of Software Testing Strategies and Trends: Semantic Content Analysis of Software Research Corpus of the Last 40 Years
    Gurcan, Fatih
    Dalveren, Gonca Gokce Menekse
    Cagiltay, Nergiz Ercil
    Roman, Dumitru
    Soylu, Ahmet
    IEEE ACCESS, 2022, 10 : 106093 - 106109
  • [48] Trends in Statistical Analysis Software Use for Horticulture Research between 2005 and 2020
    Curtis, Marina L.
    Nunez, Gerardo H.
    HORTTECHNOLOGY, 2022, 32 (04) : 356 - 358
  • [49] Statistical Analysis Software in Horticultural Research: Trends in the Period 2005-2020
    Curtis, Marina L.
    Nunez, Gerardo H.
    HORTSCIENCE, 2021, 56 (09) : S120 - S120
  • [50] CURRENT TRENDS IN COMPUTING - HARDWARE, SOFTWARE AND NUCLEAR-MAGNETIC-RESONANCE RESEARCH
    LEVY, GC
    JOURNAL OF MOLECULAR GRAPHICS, 1986, 4 (03): : 170 - 177