An analysis of design process and performance in distributed data science teams

被引:10
|
作者
Maier, Torsten [1 ]
DeFranco, Joanna [2 ]
Mccomb, Christopher [3 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
[2] Penn State Univ, Software Engn, University Pk, PA 16802 USA
[3] Penn State Univ, Engn, Main Campus, University Pk, PA 16802 USA
关键词
Teamwork; Data science; Distributed teams; Global teamwork; Kaggle data set; Software engineering teams; Technical teams; SOCIAL DILEMMAS; COMMUNICATION; COOPERATION; SIZE;
D O I
10.1108/TPM-03-2019-0024
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Purpose Often, it is assumed that teams are better at solving problems than individuals working independently. However, recent work in engineering, design and psychology contradicts this assumption. This study aims to examine the behavior of teams engaged in data science competitions. Crowdsourced competitions have seen increased use for software development and data science, and platforms often encourage teamwork between participants. Design/methodology/approach We specifically examine the teams participating in data science competitions hosted by Kaggle. We analyze the data provided by Kaggle to compare the effect of team size and interaction frequency on team performance. We also contextualize these results through a semantic analysis. Findings This work demonstrates that groups of individuals working independently may outperform interacting teams on average, but that small, interacting teams are more likely to win competitions. The semantic analysis revealed differences in forum participation, verb usage and pronoun usage when comparing top- and bottom-performing teams. Research limitations/implications - These results reveal a perplexing tension that must be explored further: true teams may experience better performance with higher cohesion, but nominal teams may perform even better on average with essentially no cohesion. Limitations of this research include not factoring in team member experience level and reliance on extant data. Originality/value These results are potentially of use to designers of crowdsourced data science competitions as well as managers and contributors to distributed software development projects.
引用
收藏
页码:419 / 439
页数:21
相关论文
共 50 条
  • [31] PERFORMANCE ANALYSIS OF DISTRIBUTED DATA-BASE SYSTEMS
    STONEBRAKER, M
    WOODFILL, J
    RANSTROM, J
    KALASH, J
    ARNOLD, K
    ANDERSEN, E
    PERFORMANCE EVALUATION, 1984, 4 (03) : 220 - 220
  • [32] Assessing the work of geographically distributed teams in engineering-design: Time allocation in the design process as a form of in-class analytics
    Miranda, Constanza
    Goñi, Julian Iñaki
    Hilliger, Isabel
    Lugo, José
    International Journal of Engineering Education, 2020, 36 (1 B) : 399 - 410
  • [33] PERFORMANCE ANALYSIS OF DISTRIBUTED DATA BASE SYSTEMS.
    Stonebraker, Michael
    Woodfill, John
    Ranstrom, Jeff
    Murphy, Marguerite
    Kalash, Joseph
    Carey, Michael
    Arnold, Kenneth
    Database Engineering, 1982, 5 (04): : 58 - 65
  • [34] Assessing the Work of Geographically Distributed Teams in Engineering-Design: Time Allocation in the Design Process as a Form of In-Class Analytics
    Miranda, Constanza
    Goni, Julian Inani
    Hilliger, Isabel
    Lugo, Jose
    INTERNATIONAL JOURNAL OF ENGINEERING EDUCATION, 2020, 36 (01) : 399 - 410
  • [35] Performance Analysis for Distributed Fusion with Different Dimensional Data
    Yuan, Xianghui
    Duan, Zhansheng
    Han, Chongzhao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [36] Interdisciplinary teacher teams: Context, design, and process
    Crow, GM
    Pounder, DG
    EDUCATIONAL ADMINISTRATION QUARTERLY, 2000, 36 (02) : 216 - 254
  • [37] Improving Performance for Geo-Distributed Data Process in Wide -Area
    Zhang, Ge
    Wang, Haozhan
    Luan, Zhongzhi
    Wu, Weiguo
    Qian, Depei
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2017, : 162 - 167
  • [38] Collaborative and distributed design process analysis for intelligent agents implementation
    Ostrosi, E
    Ferney, M
    CONCURRENT ENGINEERING: THE WORLDWIDE ENGINEERING GRID, PROCEEDINGS, 2004, : 215 - 222
  • [39] Process and Performance in Human- Robot Teams
    Lewis, Michael
    Wang, Huadong
    Chien, Shih Yi
    Velagapudi, Prasanna
    Scerri, Paul
    Sycara, Katia
    JOURNAL OF COGNITIVE ENGINEERING AND DECISION MAKING, 2011, 5 (02) : 186 - 208
  • [40] Introduction to the Emerging Cognitive Science of Distributed Human-Autonomy Teams
    Myers, Christopher W.
    Cooke, Nancy J.
    Gorman, Jamie C.
    McNeese, Nathan J.
    TOPICS IN COGNITIVE SCIENCE, 2024, 16 (03) : 377 - 390