Contextual Gaps in Machine Learning for Mental Illness Prediction: The Case of Diagnostic Disclosures

被引:0
|
作者
Chancellor S. [1 ]
Feuston J.L. [2 ]
Chang J. [3 ]
机构
[1] University of Minnesota, Minneapolis, MN
[2] University of Colorado Boulder, Boulder, CO
[3] Northwestern University, Evanston, IL
关键词
error analysis; mental health; Reddit; social media; validity;
D O I
10.1145/3610181
中图分类号
学科分类号
摘要
Getting training data for machine learning (ML) prediction of mental illness on social media data is labor intensive. To work around this, ML teams will extrapolate proxy signals, or alternative signs from data to evaluate illness status and create training datasets. However, these signals' validity has not been determined, whether signals align with important contextual factors, and how proxy quality impacts downstream model integrity. We use ML and qualitative methods to evaluate whether a popular proxy signal, diagnostic self-disclosure, produces a conceptually sound ML model of mental illness. Our findings identify major conceptual errors only seen through a qualitative investigation - training data built from diagnostic disclosures encodes a narrow vision of diagnosis experiences that propagates into paradoxes in the downstream ML model. This gap is obscured by strong performance of the ML classifier (F1 = 0.91). We discuss the implications of conceptual gaps in creating training data for human-centered models, and make suggestions for improving research methods. © 2023 ACM.
引用
收藏
相关论文
共 50 条
  • [1] Development and validation of a machine learning model for prediction of type 2 diabetes in patients with mental illness
    Bernstorff, Martin
    Hansen, Lasse
    Enevoldsen, Kenneth
    Damgaard, Jakob
    Haestrup, Frida
    Perfalk, Erik
    Danielsen, Andreas Aalkjaer
    Ostergaard, Soren Dinesen
    ACTA PSYCHIATRICA SCANDINAVICA, 2025, 151 (03) : 245 - 258
  • [2] Review of Machine Learning Algorithms for Diagnosing Mental Illness
    Cho, Gyeongcheol
    Yim, Jinyeong
    Choi, Younyoung
    Ko, Jungmin
    Lee, Seoung-Hwan
    PSYCHIATRY INVESTIGATION, 2019, 16 (04) : 262 - 269
  • [3] Predicting mental illness at workplace using machine learning
    Khan, Taha
    Dougherty, Mark
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (01) : 95 - 108
  • [4] Machine Learning Algorithms to Address the Polarity and Stigma of Mental Health Disclosures on Instagram
    Merayo, Noemi
    Ayuso-Lanchares, Alba
    Gonzalez-Sanguino, Clara
    EXPERT SYSTEMS, 2025, 42 (02)
  • [5] Improving the prediction of firm performance using nonfinancial disclosures: a machine learning approach
    Sufi, Usman
    Hasan, Arshad
    Hussainey, Khaled
    JOURNAL OF ACCOUNTING IN EMERGING ECONOMIES, 2024, 14 (05) : 1223 - 1251
  • [6] Mental Illness as a Sentencing Determinant: A Comparative Case Law Analysis Based on a Machine Learning Approach
    Thomaidou, Mia A.
    Berryessa, Colleen M.
    CRIMINAL JUSTICE AND BEHAVIOR, 2023, 50 (07) : 976 - 995
  • [7] Application of machine learning for diagnostic prediction of root caries
    Hung, Man
    Voss, Maren W.
    Rosales, Megan N.
    Li, Wei
    Su, Weicong
    Xu, Julie
    Bounsanga, Jerry
    Ruiz-Negron, Bianca
    Lauren, Evelyn
    Licari, Frank W.
    GERODONTOLOGY, 2019, 36 (04) : 395 - 404
  • [8] Predicting cardiovascular disease in patients with mental illness using machine learning
    Bernstorff, Martin
    Hansen, Lasse
    Olesen, Kevin Kris Warnakula
    Danielsen, Andreas Aalkjaer
    Ostergaard, Soren Dinesen
    EUROPEAN PSYCHIATRY, 2025, 68 (01)
  • [9] Mental Health Diagnostic System using Machine Learning Model
    Vaikole, Shubhangi
    Ummadishetty, Smaran
    Vaidya, Anushka
    Keerthi, Chaitanya
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 1751 - 1756
  • [10] Machine Learning and Conflict Prediction: A Use Case
    Perry, Chris
    STABILITY-INTERNATIONAL JOURNAL OF SECURITY AND DEVELOPMENT, 2013, 2 (03):