Multilingual Topic Classification in X: Dataset and Analysis

被引:0
|
作者
Antypas, Dimosthenis [1 ]
Ushio, Asahi [2 ]
Barbieri, Francesco [3 ]
Camacho-Collados, Jose [1 ]
机构
[1] Cardiff NLP, Cardiff University, United Kingdom
[2] Amazon, Tokyo, Japan
[3] Snap Inc., Santa Monica,CA, United States
来源
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference | 2024年
关键词
'current - Computational scientists - Linguistic analysis - Media content - Multilingual analysis - Online dialog - Social media - Topic Classification - Topic Modeling - Traditional techniques;
D O I
暂无
中图分类号
学科分类号
摘要
50
引用
收藏
页码:20136 / 20152
相关论文
共 50 条
  • [41] A Multilingual Evaluation Dataset for MonolingualWord Sense Alignment
    Ahmadi, Sina
    McCrae, John P.
    Nimb, Sanni
    Khan, Fahad
    Monachini, Monica
    Pedersen, Bolette S.
    Declerck, Thierry
    Wissik, Tanja
    Bellandi, Andrea
    Pisani, Irene
    Troelsgard, Thomas
    Olsen, Sussi
    Krek, Simon
    Lipp, Veronika
    Varadi, Tamas
    Simon, Laszlo
    Gyorffy, Andras
    Tiberius, Carole
    Schoonheim, Tanneke
    Ben Moshe, Yifat
    Rudich, Maya
    Abu Ahmad, Raya
    Lonke, Dorielle
    Kovalenko, Kira
    Langemets, Margit
    Kallas, Jelena
    Dereza, Oksana
    Fransen, Theodorus
    Cillessen, David
    Lindemann, David
    Alonso, Mikel
    Salgado, Ana
    Sancho, Jose Luis
    Urena-Ruiz, Rafael-J
    Porta Zamorano, Jordi
    Simov, Kiril
    Osenova, Petya
    Kancheva, Zara
    Radev, Ivaylo
    Stankovic, Ranka
    Perdih, Andrej
    Gabrovsek, Dejan
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3232 - 3242
  • [42] Building a Dataset of Multilingual Cognates for the Romanian Lexicon
    Ciobanu, Alina Maria
    Dinu, Liviu P.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1038 - 1043
  • [43] Multilingual Entity and Relation Extraction Dataset and Model
    Seganti, Alessandro
    Firlag, Klaudia
    Skowronska, Helena
    Satlawa, Michal
    Andruszkiewicz, Piotr
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1946 - 1955
  • [44] EUROPA: A Legal Multilingual Keyphrase Generation Dataset
    Salaun, Olivier
    Piedboeuf, Frederic
    Le Berre, Guillaume
    Hermelo, David Alfonso
    Langlais, Philippe
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12718 - 12736
  • [45] REDFM: a Filtered and Multilingual Relation Extraction Dataset
    Cabot, Pere-Lluis Huguet
    Tedeschi, Simone
    Ngomo, Axel-Cyrille Ngonga
    Navigli, Roberto
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4326 - 4343
  • [46] VoxEL: A Benchmark Dataset for Multilingual Entity Linking
    Rosales-Mendez, Henry
    Hogan, Aidan
    Poblete, Barbara
    SEMANTIC WEB - ISWC 2018, PT II, 2018, 11137 : 170 - 186
  • [47] Multitask Sentiment Analysis and Topic Classification Using BERT
    Shah, Parita
    Patel, Hiren
    Swaminarayan, Priya
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2025, 12 (01):
  • [48] Probabilistic topic modeling for the analysis and classification of genomic sequences
    Massimo La Rosa
    Antonino Fiannaca
    Riccardo Rizzo
    Alfonso Urso
    BMC Bioinformatics, 16
  • [49] Probabilistic topic modeling for the analysis and classification of genomic sequences
    La Rosa, Massimo
    Fiannaca, Antonino
    Rizzo, Riccardo
    Urso, Alfonso
    BMC BIOINFORMATICS, 2015, 16
  • [50] Improving the Efficiency and Effectiveness of Multilingual Classification Methods for Sentiment Analysis
    Ferdosian, Pantea
    Grace, Sean
    Manikandan, Vasudha
    Moles, Lucas
    Datta, Debajyoti
    Brown, Donald
    2021 SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (IEEE SIEDS 2021), 2021, : 176 - 179