Gender Representation Among Contributors to Open-Source Infrastructure

被引:4
|
作者
Qiu, Huilian Sophie [1 ,3 ]
Zhao, Zihe H. [2 ]
Yu, Tielin Katy [1 ]
Wang, Justin [1 ]
Ma, Alexander [1 ]
Fang, Hongbo [1 ]
Dabbish, Laura [1 ]
Vasilescu, Bogdan [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Rice Univ, Houston, TX USA
[3] Northwestern Univ, Evanston, IL USA
关键词
open-source software; gender diversity; SOFTWARE-DEVELOPMENT;
D O I
10.1109/ICSE-SEIS58686.2023.00025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
While the severe underrepresentation of women and non-binary people in open source is widely recognized, there is little empirical data on how the situation has changed over time and which subcommunities have been more effectively reducing the gender imbalance. To obtain a clearer image of gender representation in open source, we compiled and synthesized existing empirical data from the literature, and computed historical trends in the representation of women across 20 open source ecosystems. While inherently limited by the ability of automatic name-based gender inference to capture true gender identities at an individual level, our census still provides valuable population-level insights. Across all and in most ecosystems, we observed a promising upward trend in the percentage of women among code contributors over time, but also high variation in the percentage of women contributors across ecosystems. We also found that, in most ecosystems, women withdraw earlier from open-source participation than men. General Abstract-The representation of women and non-binary people has been extremely low in the open-source software community. Most of the statistics reported by prior studies are below 10%. However, the majority of the prior works were based on subsamples instead of the entire population. Our work started with a review of the gender distributions reported in the literature. Then we provided an overview of the gender distribution in 20 of the largest open-source ecosystem, i.e., grouped by package managers such as npm and PyPI, and investigated its change over time. Moreover, we analyzed the turnover rate between men and women contributors. Across all and in most ecosystems, we observed a promising upward trend in the percentage of women among code contributors over time, but also high variation in the percentage of women contributors across ecosystems. We also found that, in most ecosystems, women withdraw earlier from open-source participation than men.
引用
收藏
页码:180 / 187
页数:8
相关论文
共 50 条
  • [21] An open-source and extensible platform for general infrastructure asset management system
    Asghari, Vahid
    Hsu, Shu-Chien
    AUTOMATION IN CONSTRUCTION, 2021, 127
  • [22] How can contributors to open-source communities be trusted? On the assumption, inference, and substitution of trust
    Paul B. de Laat
    Ethics and Information Technology, 2010, 12 : 327 - 341
  • [23] How Does Contributors' Involvement Influence the Build Status of an Open-Source Software Project?
    Reboucas, Marcel
    Santos, Renato O.
    Pinto, Gustavo
    Castor, Fernando
    2017 IEEE/ACM 14TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2017), 2017, : 475 - 478
  • [24] How can contributors to open-source communities be trusted? On the assumption, inference, and substitution of trust
    de laat, Paul B.
    ETHICS AND INFORMATION TECHNOLOGY, 2010, 12 (04) : 327 - 341
  • [25] xGDB: open-source computational infrastructure for the integrated evaluation and analysis of genome features
    Schlueter, Shannon D.
    Wilkerson, Matthew D.
    Dong, Qunfeng
    Brendel, Volker
    GENOME BIOLOGY, 2006, 7 (11)
  • [26] SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies
    Hassan, Hasan
    Vijaykumar, Nandita
    Khan, Samira
    Ghose, Saugata
    Chang, Kevin
    Pekhimenko, Gennady
    Lee, Donghyuk
    Ergin, Oguz
    Mutlu, Onur
    2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2017, : 241 - 252
  • [27] Infrastructure as a Service (IaaS): A Comparative Performance Analysis of Open-Source Cloud Platforms
    Shahzadi, Sonia
    Iqbal, Muddesar
    Ul Qayyum, Zia
    Dagiuklas, Tasos
    2017 IEEE 22ND INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2017,
  • [28] xGDB: open-source computational infrastructure for the integrated evaluation and analysis of genome features
    Shannon D Schlueter
    Matthew D Wilkerson
    Qunfeng Dong
    Volker Brendel
    Genome Biology, 7
  • [29] OpenNoC: An Open-Source NoC Infrastructure for FPGA-Based Hardware Acceleration
    Reddy, Kuladeep Sai
    Vipin, Kizheppatt
    IEEE EMBEDDED SYSTEMS LETTERS, 2019, 11 (04) : 123 - 126
  • [30] Sourcerer: An infrastructure for large-scale collection and analysis of open-source code
    Bajracharya, Sushi
    Ossher, Joel
    Lopes, Cristina
    SCIENCE OF COMPUTER PROGRAMMING, 2014, 79 : 241 - 259