Cross-Language Fake News Detection

被引:0
|
作者
Chu S.K.W. [1 ]
Xie R. [1 ]
Wang Y. [1 ]
机构
[1] University of Hong Kong, Hong Kong
关键词
cross-language study; fake news detection; information detection;
D O I
10.2478/dim-2020-0025
中图分类号
学科分类号
摘要
With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society also increases. However, most of the fake news detection research focuses only on English. In this work, we compared the difference between textual features of different languages (Chinese and English) and their effect on detecting fake news. We also explored the cross-language transmissibility of fake news detection models. We found that Chinese textual features in fake news are more complex compared with English textual features. Our results also illustrated that the bidirectional encoder representations from transformers (BERT) model outperformed other algorithms for within-language data sets. As for detection in cross-language data sets, our findings demonstrated that fake news monitoring across languages is potentially feasible, while models trained with data from a more inclusive language would perform better in cross-language detection. © 2021 Samuel Kai Wah Chu et al., published by Sciendo
引用
收藏
页码:100 / 109
页数:9
相关论文
共 50 条
  • [1] Cross-Language Prominence Detection
    Rosenberg, Andrew
    Cooper, Erica
    Levitan, Rivka
    Hirschberg, Julia
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 278 - 281
  • [2] Cross-language plagiarism detection
    Potthast, Martin
    Barron-Cedeno, Alberto
    Stein, Benno
    Rosso, Paolo
    LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (01) : 45 - 62
  • [3] Cross-language plagiarism detection
    Martin Potthast
    Alberto Barrón-Cedeño
    Benno Stein
    Paolo Rosso
    Language Resources and Evaluation, 2011, 45 : 45 - 62
  • [4] Methods for cross-language plagiarism detection
    Barron-Cedeno, Alberto
    Gupta, Parth
    Rosso, Paolo
    KNOWLEDGE-BASED SYSTEMS, 2013, 50 : 211 - 217
  • [5] CROSS-LANGUAGE PHRASE BOUNDARY DETECTION
    Soto, Victor
    Cooper, Erica
    Rosenberg, Andrew
    Hirschberg, Julia
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8460 - 8464
  • [6] A Review of Fake News Detection Techniques for Arabic Language
    Alotaibi, Taghreed
    Al-Dossari, Hmood
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 392 - 407
  • [7] Fake news article detection datasets for Hindi language
    Kumar, Sujit
    Shankhdhar, Anant
    Singal, Divyam
    Aggarwal, Bhuvan
    Malhotra, Ahaan Sameer
    Ranbir Singh, Sanasam
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [8] A Survey on Natural Language Processing for Fake News Detection
    Oshikawa, Ray
    Qian, Jing
    Wang, William Yang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6086 - 6093
  • [9] Detection of fake news in a new corpus for the Spanish language
    Posadas-Duran, Juan-Pablo
    Gomez-Adorno, Helena
    Sidorov, Grigori
    Moreno Escobar, Jesus Jaime
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4869 - 4876
  • [10] Leveraging Wikipedia Knowledge to Cross-Language Classify Textual News
    Antonio Mourino-Garcia, Marcos
    Perez-Rodriguez, Roberto
    Anido-Rifon, Luis
    2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2017, : 164 - 168