Multilingual Topic Classification in X: Dataset and Analysis

被引:0
|
作者
Antypas, Dimosthenis [1 ]
Ushio, Asahi [2 ]
Barbieri, Francesco [3 ]
Camacho-Collados, Jose [1 ]
机构
[1] Cardiff NLP, Cardiff University, United Kingdom
[2] Amazon, Tokyo, Japan
[3] Snap Inc., Santa Monica,CA, United States
来源
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference | 2024年
关键词
'current - Computational scientists - Linguistic analysis - Media content - Multilingual analysis - Online dialog - Social media - Topic Classification - Topic Modeling - Traditional techniques;
D O I
暂无
中图分类号
学科分类号
摘要
50
引用
收藏
页码:20136 / 20152
相关论文
共 50 条
  • [21] MULTIFIN: A Dataset for Multilingual Financial NLP
    Jorgensen, Rasmus Kaer
    Brandt, Oliver
    Hartmann, Mareike
    Dai, Xiang
    Igel, Christian
    Elliott, Desmond
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 894 - 909
  • [22] Slovak Dataset for Multilingual Question Answering
    Hladek, Daniel
    Stas, Jan
    Juhar, Jozef
    Koctur, Tomas
    IEEE ACCESS, 2023, 11 : 32869 - 32881
  • [23] JukeBox: A Multilingual Singer Recognition Dataset
    Chowdhury, Anurag
    Cozzo, Austin
    Ross, Arun
    INTERSPEECH 2020, 2020, : 2267 - 2271
  • [24] VoxTube: a multilingual speaker recognition dataset
    Yakovlev, Ivan
    Okhotnikov, Anton
    Torgashov, Nikita
    Makarov, Rostislav
    Voevodin, Yuri
    Simonchik, Konstantin
    INTERSPEECH 2023, 2023, : 2238 - 2242
  • [25] CITOM: An incremental construction of multilingual topic maps
    Ellouze, Nebrasse
    Lammari, Nadira
    Metais, Elisabeth
    DATA & KNOWLEDGE ENGINEERING, 2012, 74 : 46 - 62
  • [26] Topic Detection using BNgram Method and Sentiment Analysis on Twitter Dataset
    Tembhurnikar, Suvarna D.
    Patil, Nitin N.
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [27] TCMeta: a multilingual dataset of COVID tweets for relation-level metaphor analysis
    Brglez, Mojca
    Zayed, Omnia
    Buitelaar, Paul
    LANGUAGE RESOURCES AND EVALUATION, 2025, 59 (01) : 437 - 475
  • [28] Performance Analysis of Classification Algorithms on Birth Dataset
    Abbas, Syed Ali
    Rehman, Aqeel Ur
    Majeed, Fiaz
    Majid, Abdul
    Malik, M. Sheraz Arshed
    Kazmi, Zaki Hassan
    Zafar, Seemab
    IEEE ACCESS, 2020, 8 : 102146 - 102154
  • [29] Social Data Sentiment Analysis of a Multilingual Dataset: A Case Study with Malayalam and English
    Mathews, Deepa Mary
    Abraham, Sajimon
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 : 70 - 78
  • [30] A Semi-discriminative Approach for Sub-sentence Level Topic Classification on a Small Dataset
    Ferner, Cornelia
    Wegenkittl, Stefan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 697 - 710