Validating Machine Learning Algorithms for Twitter Data Against Established Measures of Suicidality

被引:106
作者
Braithwaite, Scott R. [1 ]
Giraud-Carrier, Christophe [2 ]
West, Josh [3 ]
Barnes, Michael D. [3 ]
Hanson, Carl Lee [3 ]
机构
[1] Brigham Young Univ, Dept Psychol, Computat Hlth Sci Res Grp, Provo, UT 84602 USA
[2] Brigham Young Univ, Dept Comp Sci, Computat Hlth Sci Res Grp, Provo, UT 84602 USA
[3] Brigham Young Univ, Dept Hlth Sci, Computat Hlth Sci Res Grp, 4103B Life Sci Bldg, Provo, UT 84602 USA
关键词
suicide; social media; twitter; machine learning;
D O I
10.2196/mental.4822
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Background: One of the leading causes of death in the United States (US) is suicide and new methods of assessment are needed to track its risk in real time. Objective: Our objective is to validate the use of machine learning algorithms for Twitter data against empirically validated measures of suicidality in the US population. Methods: Using a machine learning algorithm, the Twitter feeds of 135 Mechanical Turk (MTurk) participants were compared with validated, self-report measures of suicide risk. Results: Our findings show that people who are at high suicidal risk can be easily differentiated from those who are not by machine learning algorithms, which accurately identify the clinically significant suicidal rate in 92% of cases (sensitivity: 53%, specificity: 97%, positive predictive value: 75%, negative predictive value: 93%). Conclusions: Machine learning algorithms are efficient in differentiating people who are at a suicidal risk from those who are not. Evidence for suicidality can be measured in nonclinical populations using social media data.
引用
收藏
页数:10
相关论文
共 59 条
[1]   The Reliability of Tweets as a Supplementary Method of Seasonal Influenza Surveillance [J].
Aslam, Anoshe A. ;
Tsou, Ming-Hsiang ;
Spitzberg, Brian H. ;
An, Li ;
Gawron, J. Mark ;
Gupta, Dipak K. ;
Peddecord, K. Michael ;
Nagel, Anna C. ;
Allen, Christopher ;
Yang, Jiue-An ;
Lindsay, Suzanne .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2014, 16 (11)
[2]   SUICIDE AS ESCAPE FROM SELF [J].
BAUMEISTER, RF .
PSYCHOLOGICAL REVIEW, 1990, 97 (01) :90-113
[3]   Psychometric properties of the Beck Self-Esteem Scales [J].
Beck, AT ;
Brown, GK ;
Steer, RA ;
Kuyken, W ;
Grisham, J .
BEHAVIOUR RESEARCH AND THERAPY, 2001, 39 (01) :115-124
[4]  
Breiman L., 1984, CLASSIFICATION REGRE, V1st, DOI 10.1201/9781315139470
[5]   Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data? [J].
Buhrmester, Michael ;
Kwang, Tracy ;
Gosling, Samuel D. .
PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2011, 6 (01) :3-5
[6]  
Canetto SS, 2002, J PSYCHOL, V136, P573, DOI 10.1080/00223980209605552
[7]  
Centers for Disease Control and Prevention, 2015, FAT INJ DAT
[8]  
Clay R, MONITOR PSYCHOL SUIC
[9]   THE SCIENCE OF PREVENTION - A CONCEPTUAL-FRAMEWORK AND SOME DIRECTIONS FOR A NATIONAL RESEARCH-PROGRAM [J].
COIE, JD ;
WATT, NF ;
WEST, SG ;
HAWKINS, JD ;
ASARNOW, JR ;
MARKMAN, HJ ;
RAMEY, SL ;
SHURE, MB ;
LONG, B .
AMERICAN PSYCHOLOGIST, 1993, 48 (10) :1013-1022
[10]  
Coppersmith G., 2015, PROC 2 WORKSHOP COMP, P1, DOI 10.3115/v1/W15-1201