Triaging content severity in online mental health forums

被引:29
作者
Cohan, Arman [1 ]
Young, Sydney [1 ]
Yates, Andrew [2 ]
Goharian, Nazli [1 ]
机构
[1] Georgetown Univ, Dept Comp Sci, Informat Retrieval Lab, Washington, DC 20057 USA
[2] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
基金
美国国家科学基金会;
关键词
EMOTION DETECTION; SUICIDE; SENTIMENT;
D O I
10.1002/asi.23865
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, social media has become a significant resource for improving healthcare and mental health. Mental health forums are online communities where people express their issues, and seek help from moderators and other users. In such forums, there are often posts with severe content indicating that the user is in acute distress and there is a risk of attempted self-harm. Moderators need to respond to these severe posts in a timely manner to prevent potential self-harm. However, the large volume of daily posted content makes it difficult for the moderators to locate and respond to these critical posts. We propose an approach for triaging user content into four severity categories that are defined based on an indication of self-harm ideation. Our models are based on a feature-rich classification framework, which includes lexical, psycholinguistic, contextual, and topic modeling features. Our approaches improve over the state of the art in triaging the content severity in mental health forums by large margins (up to 17% improvement over the F-1 scores). Furthermore, using our proposed model, we analyze the mental state of users and we show that overall, long-term users of the forum demonstrate decreased severity of risk over time. Our analysis on the interaction of the moderators with the users further indicates that without an automatic way to identify critical content, it is indeed challenging for the moderators to provide timely response to the users in need.
引用
收藏
页码:2675 / 2689
页数:15
相关论文
共 59 条
[1]  
American Foundation for Suicide Prevention, 2016, SUIC STAT 2016
[2]  
[Anonymous], 2016, P 3 WORKSH COMP LING, DOI [10.18653/v1/w16-0311, DOI 10.18653/V1/W16-0311]
[3]  
[Anonymous], 2016, PROC 3 WORKSHOP COMP, DOI 10.18653/v1/W16-0314
[4]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   Validating Machine Learning Algorithms for Twitter Data Against Established Measures of Suicidality [J].
Braithwaite, Scott R. ;
Giraud-Carrier, Christophe ;
West, Josh ;
Barnes, Michael D. ;
Hanson, Carl Lee .
JMIR MENTAL HEALTH, 2016, 3 (02)
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Brew Chris, 2016, P 3 WORKSH COMP LING, P138, DOI [10.18653/v1/W16-0315, DOI 10.18653/V1/W16-0315]
[9]  
Burnap P., 2015, P 26 ACM C HYP SOC M, P75, DOI DOI 10.1145/2700171.2791023
[10]  
Centers for Disease Control and Prevention, 2015, SUIC FACT GLANC 2015