Identifying Predictive Causal Factors from News Streams

被引:0
|
作者
Balashankar, Ananth [1 ]
Chakraborty, Sunandan [2 ]
Fraiberger, Samuel [1 ,3 ]
Subramanian, Lakshminarayanan [1 ]
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA
[2] Indiana Univ Indianapolis, Sch Informat & Comp, Indianapolis, IN USA
[3] World Bank, 1818 H St NW, Washington, DC 20433 USA
关键词
SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new framework to uncover the relationship between news events and real world phenomena. We present the Predictive Causal Graph (PCG) which allows to detect latent relationships between events mentioned in news streams. This graph is constructed by measuring how the occurrence of a word in the news influences the occurrence of another (set of) word(s) in the future. We show that PCG can be used to extract latent features from news streams, outperforming other graph-based methods in prediction error of 10 stock price time series for 12 months. We then extended PCG to be applicable for longer time windows by allowing time-varying factors, leading to stock price prediction error rates between 1.5% and 5% for about 4 years. We then manually validated PCG, finding that 67% of the causation semantic frame arguments present in the news corpus were directly connected in the PCG, the remaining being connected through a semantically relevant intermediate node.
引用
收藏
页码:2338 / 2348
页数:11
相关论文
共 50 条
  • [21] Predictive Factors for Identifying Patients With Inadequate Bowel Preparation
    Hovsepians, Rita
    Liu, Lin
    Yang, Michael
    Maples, Michelle
    Groessl, Eric J.
    Gupta, Samir
    Ho, Samuel B.
    GASTROINTESTINAL ENDOSCOPY, 2017, 85 (05) : AB179 - AB179
  • [22] Identifying predictive factors of adherence and persistence to rivastigmine patch
    Riepe, M.
    Weinman, J.
    Mueller, B.
    Brady, R.
    Strohmaier, C.
    EUROPEAN JOURNAL OF NEUROLOGY, 2012, 19 : 476 - 476
  • [23] Methodology for identifying activities from GPS data streams
    Usyukov, Vladimir
    8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 : 10 - 17
  • [24] Theory matters for identifying a causal role for genetic factors in socioeconomic outcomes
    Durlauf, Steven N.
    Rustichini, Aldo
    BEHAVIORAL AND BRAIN SCIENCES, 2023, 46
  • [25] Theory matters for identifying a causal role for genetic factors in socioeconomic outcomes
    Durlauf, Steven N.
    Rustichini, Aldo
    BEHAVIORAL AND BRAIN SCIENCES, 2023, 46
  • [26] A Framework for Identifying Causal Factors of Delay in Nuclear Power Plant Projects
    Alsharif, Samer
    Karatas, Aslihan
    ICSDEC 2016 - INTEGRATING DATA SCIENCE, CONSTRUCTION AND SUSTAINABILITY, 2016, 145 : 1486 - 1492
  • [27] Event mining and timeliness analysis from heterogeneous news streams
    Mele, Ida
    Bahrainian, Seyed Ali
    Crestani, Fabio
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (03) : 969 - 993
  • [28] EXTRACTING SIGNALS FROM NEWS STREAMS FOR DISEASE OUTBREAK PREDICTION
    Chakraborty, Sunandan
    Subramanian, Lakshminarayanan
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 1300 - 1304
  • [29] Predictive factors for identifying macrolide responder in treating chronic rhinosinusitis
    Seresirikachorn, Kachorn
    Kerr, Stephen J.
    Aeumjaturapat, Songklot
    Chusakul, Supinda
    Kanjanaumporn, Jesada
    Wongpiyabovorn, Jongkonnee
    Snidvongs, Kornkiat
    RHINOLOGY, 2021, 59 (03) : 284 - 291
  • [30] Identifying Fake News from the Variables that Governs the Spread of Fake News.
    Dordevic, Milan
    Pourghomi, Pardis
    Safieddine, Fadi
    2020 15TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2020), 2020, : 135 - 140