TwitterBulletin: An Intelligent and Real-Time Automated News Categorization Tool for Twitter

被引:1
|
作者
Demirci, Sedef [1 ]
Sagiroglu, Seref [1 ]
机构
[1] Gazi Univ, Ankara, Turkey
关键词
News classification; artificial intelligence; deep learning; social media; Twitter; news topic modelling; news dataset; TEXT; JOURNALISM;
D O I
10.3897/jucs.69377
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Social media platforms have become popular news sources thanks to their immense popularity and high speed of information dissemination. Using these platforms is essential for news organizations and journalists to track and discover news in digital journalism age. However, the abundance of meaningless data and the lack of organization on these platforms make it difficult to reach valuable news for journalists. In this paper, we create the first public dataset containing large number of real-world Turkish news tweets belonging to different news categories, to the best of our knowledge. We propose an artificial intelligence-based two-step approach to assist journalists for accessing the news shared by various sources on social media under the relevant categories like politics (elections, riots, etc.), health (pandemic, covid-19, etc.), etc. via a single platform by reducing the possibility of overlooking needed information. In the first step, we propose a machine learning based novel model for collecting and categorizing news posts on social media. We implement several traditional machine learning and deep learning based algorithms and evaluate their classification performance in terms of accuracy, precision, recall, and F1 score. In the second step, we develop a software tool, named TwitterBulletin, which automatically retrieves Turkish news tweets and groups them under news categories in real time by using the CNN classifier which achieves the best performance in the first step. The results show that the overall accuracy rate of TwitterBulletin is reasonably high and satisfactory despite the challenge of classifying short tweets.
引用
收藏
页码:345 / 377
页数:33
相关论文
共 50 条
  • [41] TweetCred: Real-Time Credibility Assessment of Content on Twitter
    Gupta, Aditi
    Kumaraguru, Ponnurangam
    Castillo, Carlos
    Meier, Patrick
    SOCIAL INFORMATICS, SOCINFO 2014, 2014, 8851 : 228 - 243
  • [42] Real-time Event Detection in Twitter: A Case Study
    Sani, Ali Momen
    Moeini, Ali
    2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 48 - 51
  • [43] Real-time Traffic Classification with Twitter Data Mining
    Kurniawan, Dwi Aji
    Wibirama, Sunu
    Setiawan, Noor Akhmad
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2016,
  • [44] Scalable and Real-time Sentiment Analysis of Twitter Data
    Karanasou, Maria
    Ampla, Anneta
    Doulkeridis, Christos
    Halkidi, Maria
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 944 - 951
  • [45] Toward Real-Time Infoveillance of Twitter Health Messages
    Colditz, Jason B.
    Chu, Kar-Hai
    Emery, Sherry L.
    Larkin, Chandler R.
    James, A. Everette
    Welling, Joel
    Primack, Brian A.
    AMERICAN JOURNAL OF PUBLIC HEALTH, 2018, 108 (08) : 1009 - 1014
  • [46] Match the News: a Firefox Extension for Real-Time News Recommendation
    Karkali, Margarita
    Pontikis, Dimitris
    Vazirgiannis, Michalis
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1117 - 1118
  • [47] Real-Time Data Harvesting Method for Czech Twitter
    Kral, Pavel
    Rajtmajer, Vaclav
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 259 - 265
  • [48] Real-Time Diffusion of Information on Twitter and the Financial Markets
    Tafti, Ali
    Zotti, Ryan
    Jank, Wolfgang
    PLOS ONE, 2016, 11 (08):
  • [49] Real-time Detection of Cyberbullying in Arabic Twitter Streams
    Mouheb, Djedjiga
    Abushamleh, Masa Hilal
    Abushamleh, Maya Hilal
    Al Aghbari, Zaher
    Kamel, Ibrahim
    2019 10TH IFIP INTERNATIONAL CONFERENCE ON NEW TECHNOLOGIES, MOBILITY AND SECURITY (NTMS), 2019,
  • [50] Real-time Automated Answer Scoring
    Nagaraj, Akash
    Sood, Mukund
    Srinivasa, Gowri
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2018), 2018, : 231 - 232