The Limits of Empiricism: A Critique of Data-Driven Theory Development

被引:0
|
作者
Van Slyke, Craig [1 ]
Kamis, Arnold [2 ]
机构
[1] Louisiana Tech Univ, Coll Business, Ruston, LA 71272 USA
[2] Brandeis Univ, Data Analyt, Waltham, MA USA
来源
关键词
Information Security; Empiricism; Surveys; Research Methods; INFORMATION-SYSTEMS RESEARCH; COMMON METHOD VARIANCE; GROUNDED THEORY; CONCEPT DRIFT; DATA QUALITY; KNOWLEDGE; STATE;
D O I
10.1145/3663682.3663689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The abundance of data available to researchers has led to increasing interest in data-derived theoretical development. Although this is a valid method of deriving theoretical models, it is subject to numerous limitations and hazards that may threaten the validity and usefulness of the models. The purpose of this paper is to critique empirically driven theoretical development. Our goal is to offer a cautionary tale about the limits of derivation of theory from empirical analysis in the hopes that our analysis and critique can strengthen empirical derivation of theory. In this paper, we use the empirical derivation of the Unified Model of Information Security Policy Compliance (UMISPC) as a research case study to illustrate some of these limitations and risks. For example, we critique the opportunistic dropping of theoretical paths based on statistical results, cautioning that doing so is insufficient for forming new theory. We also report several attempts at validating UMISPC through replication, including our own, which used data from a survey of 525 employed American adults. Comparison of the replications and original model indicates a general failure to replicate substantial portions of the original paper. We discuss five specific pitfalls associated with empirically driven model development and make recommendations for future studies that use inductive, data-driven approaches to derive theoretical models.
引用
收藏
页码:119 / 145
页数:27
相关论文
共 50 条
  • [31] A Pipeline for Integrated Theory and Data-Driven Modeling of Biomedical Data
    Raghu, Vineet K.
    Ge, Xiaoyu
    Balajiee, Arun
    Shirer, Daniel J.
    Das, Isha
    Benos, Panayiotis, V
    Chrysanthis, Panos K.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) : 811 - 822
  • [32] Fundamental Limits of Data Utility: A Case Study for Data-Driven Identity Authentication
    Yang, Qing
    Wang, Cheng
    Wang, Changqi
    Teng, Hu
    Jiang, Changjun
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (02) : 398 - 409
  • [33] Development of Data-Driven System in Materials Integration
    Inoue, Junya
    Okada, Masato
    Nagao, Hiromichi
    Yokota, Hideo
    Adachi, Yoshitaka
    MATERIALS TRANSACTIONS, 2020, 61 (11) : 2058 - 2066
  • [34] Data-Driven Development and Evaluation of Enskill English
    Johnson, W. Lewis
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2019, 29 (03) : 425 - 457
  • [35] Data-Driven Development and Evaluation of Enskill English
    W. Lewis Johnson
    International Journal of Artificial Intelligence in Education, 2019, 29 : 425 - 457
  • [36] Challenges of data-driven methods in product development
    Mehlstäubl J.
    Gadzo E.
    Atzberger A.
    Paetzold K.
    Konstruktion, 2022, 74 (06): : 60 - 66
  • [37] Data-driven inference of physical devices: theory and implementation
    Buscemi, Francesco
    Dall'Arno, Michele
    NEW JOURNAL OF PHYSICS, 2019, 21 (11):
  • [38] Data-driven finite element method: Theory and applications
    Amir Siddiq, M.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2021, 235 (17) : 3329 - 3339
  • [39] Data-driven approximations in electronic structure theory.
    Janesko, BG
    Yaron, D
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 228 : U253 - U253
  • [40] Data-Driven Fault Supervisory Control Theory and Applications
    Zhang, Huaguang
    Jiang, Bin
    Yu, Wen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013