The Causal News Corpus: Annotating Causal Relations in Event Sentences from News

被引:0
|
作者
Tan, Fiona Anting [1 ]
Hurriyetoglu, Ali [2 ]
Caselli, Tommaso [3 ]
Oostdijk, Nelleke [4 ]
Nomoto, Tadashi [5 ]
Hettiarachchi, Hansi [6 ]
Ameer, Iqra [7 ]
Uca, Onur [8 ]
Liza, Farhana Ferdousi [9 ]
Hu, Tiancheng [10 ]
机构
[1] Natl Univ Singapore, Inst Data Sci, Singapore, Singapore
[2] Koc Univ, Istanbul, Turkey
[3] Univ Groningen, Groningen, Netherlands
[4] Radboud Univ Nijmegen, Nijmegen, Netherlands
[5] Natl Inst Japanese Literature, Tokyo, Japan
[6] Birmingham City Univ, Birmingham, W Midlands, England
[7] Inst Politecn Nacl, Ctr Invest Comp, Mexico City, DF, Mexico
[8] Mersin Univ, Dept Sociol, Mersin, Turkey
[9] Univ East Anglia, Norwich, Norfolk, England
[10] Swiss Fed Inst Technol, Zurich, Switzerland
基金
新加坡国家研究基金会;
关键词
causality; event causality; text mining; natural language understanding;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Despite the importance of understanding causality, corpora addressing causal relations are limited. There is a discrepancy between existing annotation guidelines of event causality and conventional causality corpora that focus more on linguistics. Many guidelines restrict themselves to include only explicit relations or clause-based arguments. Therefore, we propose an annotation schema for event causality that addresses these concerns. We annotated 3,559 event sentences from protest event news with labels on whether it contains causal relations or not. Our corpus is known as the Causal News Corpus (CNC). A neural network built upon a state-of-the-art pre-trained language model performed well with 81.20% F1 score on test set, and 83.46% in 5-folds cross-validation. CNC is transferable across two external corpora: CausalTimeBank (CTB) and Penn Discourse Treebank (PDTB). Leveraging each of these external datasets for training, we achieved up to approximately 64% F1 on the CNC test set without additional fine-tuning. CNC also served as an effective training and pre-training dataset for the two external corpora. Lastly, we demonstrate the difficulty of our task to the layman in a crowd-sourced annotation exercise. Our annotated corpus is publicly available, providing a valuable resource for causal text mining researchers.
引用
收藏
页码:2298 / 2310
页数:13
相关论文
共 50 条
  • [31] Learning Discourse Relations from News Reports: An Event-driven Approach
    Reyes, J. A.
    Montes, A.
    IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (01) : 356 - 363
  • [32] Quotebank: A Corpus of Quotations from a Decade of News
    Vaucher, Timote
    Spitz, Andreas
    Catasta, Michele
    West, Robert
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 328 - 336
  • [33] Event Detection from News Articles
    Sayyadi, Hassan
    Sahraei, Alireza
    Abolhassani, Hassan
    ADVANCES IN COMPUTER SCIENCE AND ENGINEERING, 2008, 6 : 981 - 984
  • [34] Discovering the Causal Network of Terms from the Text Corpus
    Wang, Yue
    2015 IEEE 12TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2015, : 1 - 6
  • [35] TCCM: Time and Content-Aware Causal Model for Unbiased News Recommendation
    Chen, Yewang
    Ye, Weiyao
    Xv, Guipeng
    Lin, Chen
    Zhu, Xiaomin
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3778 - 3782
  • [36] Bias Mitigation for Evidence-aware Fake News Detection by Causal Intervention
    Wu, Junfei
    Liu, Qiang
    Xu, Weizhi
    Wu, Shu
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2308 - 2313
  • [37] What Boosts Fake News Dissemination on Social Media? A Causal Inference View
    Li, Yichuan
    Lee, Kyumin
    Kordzadeh, Nima
    Guo, Ruocheng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT IV, 2023, 13938 : 234 - 246
  • [38] Causal Intervention and Counterfactual Reasoning for Multi-modal Fake News Detection
    Chen, Ziwei
    Hu, Linmei
    Li, Weixin
    Shao, Yingxia
    Nie, Liqiang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 627 - 638
  • [39] Reacting to headline news: Circumstances leading to causal explanations versus implicational concerns
    Singh, Ramadhar
    Kaur, Susheel
    Junid, Fazlinda B.
    Self, William T.
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2011, 46 (01) : 63 - 70
  • [40] The Blame Game Elements of Causal Attribution and Its Impact on Siding With Agents in the News
    Knobloch-Westerwick, Silvia
    Taylor, Laramie D.
    COMMUNICATION RESEARCH, 2008, 35 (06) : 723 - 744