Turning user generated health-related content into actionable knowledge through text analytics services

被引:33
|
作者
Martinez, Paloma [1 ]
Martinez, Jose L. [3 ]
Segura-Bedmar, Isabel [2 ]
Moreno-Schneider, Julian [2 ]
Luna, Adrian [3 ]
Revert, Ricardo [2 ]
机构
[1] Univ Carlos III Madrid, Dept Comp Sci, Adv Databases Grp, E-28903 Getafe, Spain
[2] Univ Carlos III Madrid, Dept Comp Sci, E-28903 Getafe, Spain
[3] MeaningCloud LLC, San Francisco, CA USA
关键词
ADVERSE DRUG-REACTIONS; SOCIAL MEDIA; BIOMEDICAL LITERATURE; EXTRACTION; FDA;
D O I
10.1016/j.compind.2015.10.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In the last years, the habit of discussing healthcare issues with family and friends, even with unknown people, in the context of social networks has increased and processing user generated content has become a new challenge. This can help in on-line crowd surveillance for different applications (pharmacovigilance and filtering health contents in blogs among others) as well as extracting knowledge from unstructured text sources. In this article, a system that monitors health social media streams is described. It is based on several text analytics processes supported, among others, by MeaningCloud, a commercial platform which provides meaning extraction from texts in a Software as a Service mode. In this architecture, several domain resources are integrated to detect drugs and drug effects such as CIMA (official information about authorized drugs in Spain maintained by the Spanish Agency of Medicines and Health Products), MedDRA (Medical Dictionary for Regulatory Activities) and the SpanishDrugEffectDB database that contains relations between drugs and effects. Different ways of visualizing data considering time lines and aggregated data have been implemented. In order to show performance, an evaluation has been carried out over Named Entities Recognition (NER) and Relation Extraction (RE) tasks. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 32 条