Automatic free-text-tagging of online news archives

被引:3
|
作者
Farkas, Richard [1 ]
Berend, Gabor [1 ]
Hegedus, Istvan [1 ]
Karpati, Andras [1 ]
Krich, Balazs [1 ]
机构
[1] Hungarian Acad Sci, H-1051 Budapest, Hungary
关键词
D O I
10.3233/978-1-60750-606-5-529
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we shall introduce the problem of free-text-tagging of online news archives. From an application point of view, it has many benefits for online news portals and on the other hand, the task has unique characteristics compared to existing approaches for free-text-tagging. We shall describe our system, which was developed for the archive (consisting of 370 thousand articles) of the most visited Hungarian news portal www.origo.hu, along with research questions encountered and solved during our task. As the evaluation of tagging is not straightforward at the end of the project the news company manually investigated the tagging of the automatic system which yielded an F-measure of 71.9.
引用
收藏
页码:529 / 534
页数:6
相关论文
共 50 条
  • [41] Automatic Debate Text Summarization in Online Debate Forum
    Chowanda, Alan Darmasaputra
    Sanyoto, Albert Richard
    Suhartono, Derwin
    Setiadi, Criscentia Jessica
    DISCOVERY AND INNOVATION OF COMPUTER SCIENCE TECHNOLOGY IN ARTIFICIAL INTELLIGENCE ERA, 2017, 116 : 11 - 19
  • [42] Discovering News Frames: An Approach for Exploring Text, Content, and Concepts in Online News Sources
    Cheeks, Loretta H.
    Stepien, Tracy L.
    Wald, Dara M.
    Gaffar, Ashraf
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2016, 7 (04): : 45 - 62
  • [43] A Framework for the Automatic Extraction of Rules from Online Text
    Hassanpour, Saeed
    O'Connor, Martin J.
    Das, Amar K.
    RULE-BASED REASONING, PROGRAMMING, AND APPLICATIONS, 2011, 6826 : 266 - 280
  • [44] Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text
    Damnati, Geraldine
    Auguste, Jeremy
    Nasr, Alexis
    Charlet, Delphine
    Heinecke, Johannes
    Bechet, Frederic
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 88 - 92
  • [45] An evaluation of automatic text categorization in online discussion analysis
    Lui, Andrew Kwok-Fai
    Li, Siu Cheung
    Choy, Sheung On
    7TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2007, : 205 - +
  • [46] Hierarchical classification in text mining for sentiment analysis of online news
    Jinyan Li
    Simon Fong
    Yan Zhuang
    Richard Khoury
    Soft Computing, 2016, 20 : 3411 - 3420
  • [47] Hierarchical classification in text mining for sentiment analysis of online news
    Li, Jinyan
    Fong, Simon
    Zhuang, Yan
    Khoury, Richard
    SOFT COMPUTING, 2016, 20 (09) : 3411 - 3420
  • [48] Analysis of Online News Coverage on Earthquakes Through Text Mining
    Camilleri, Stephen
    Agius, Matthew R.
    Azzopardi, Joel
    FRONTIERS IN EARTH SCIENCE, 2020, 8
  • [49] Deep Learning Algorithms for Detecting Fake News in Online Text
    Girgis, Sherry
    Amer, Eslam
    Gadallah, Mahmoud
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 93 - 97
  • [50] Text Summarization for Tamil Online Sports News Using NLP
    Priyadharshan, Thevatheepan
    Sumathipala, Sagara
    2018 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY RESEARCH (ICITR), 2018,