Multilingual Topic Classification in X: Dataset and Analysis

被引:0
|
作者
Antypas, Dimosthenis [1 ]
Ushio, Asahi [2 ]
Barbieri, Francesco [3 ]
Camacho-Collados, Jose [1 ]
机构
[1] Cardiff NLP, Cardiff University, United Kingdom
[2] Amazon, Tokyo, Japan
[3] Snap Inc., Santa Monica,CA, United States
来源
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference | 2024年
关键词
'current - Computational scientists - Linguistic analysis - Media content - Multilingual analysis - Online dialog - Social media - Topic Classification - Topic Modeling - Traditional techniques;
D O I
暂无
中图分类号
学科分类号
摘要
50
引用
收藏
页码:20136 / 20152
相关论文
共 50 条
  • [31] A Multilingual Handwritten Character Dataset: T-H-E Dataset
    Bartos, Gaye Ediboglu
    Hoscan, Yasar
    Kauer, Andras
    Hajnal, Eva
    ACTA POLYTECHNICA HUNGARICA, 2020, 17 (09) : 141 - 160
  • [32] Performance Analysis of Classification Algorithms on Birth Dataset
    Rehman, Aqeel Ur (rehmancqu@gmail.com), 1600, Institute of Electrical and Electronics Engineers Inc., United States (08):
  • [33] Multilingual Topic Models for Bilingual Dictionary Extraction
    Liu, Xiaodong
    Duh, Kevin
    Matsumoto, Yuji
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2015, 14 (03)
  • [34] A New Dataset for Topic-Based Paragraph Classification in Genocide-Related Court Transcripts
    Schirmer, Miriam
    Kruschwitz, Udo
    Donabauer, Gregor
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4504 - 4512
  • [35] Multilingual Documentation and Classification
    Donnelly, Kevin
    EHEALTH: COMBINING HEALTH TELEMATICS, TELEMEDICINE, BIOMEDICAL ENGINEERING AND BIOINFORMATICS TO THE EDGE: GLOBAL EXPERTS SUMMIT TEXTBOOK, 2008, 134 : 235 - 243
  • [36] TOPIC MODELING FOR USER FEEDBACK DATASET
    Pangastuti, Sinta septi
    Rohmatullayaly, Eneng nunuz
    Najmi, Nuroh
    COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2025,
  • [37] An Annotated Multilingual Dataset to Study Modality in the Gospels
    Bermudez-Sabel, Helena
    Dell'Oro, Francesca
    DIGITAL HUMANITIES QUARTERLY, 2024, 18 (01): : 1 - 16
  • [38] SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
    Clark, Elizabeth
    Rijhwani, Shruti
    Gehrmann, Sebastian
    Maynez, Joshua
    Aharoni, Roee
    Nikolaev, Vitaly
    Sellam, Thibault
    Siddhant, Aditya
    Das, Dipanjan
    Parikh, Ankur P.
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 9397 - 9413
  • [39] A new dataset for French and multilingual keyphrase generation
    Piedboeuf, Frederic
    Langlais, Philippe
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [40] XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
    Ponti, Edoardo M.
    Glaves, Goran
    Majewska, Olga
    Liu, Qianchu
    Vulic, Ivan
    Korhonen, Anna
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2362 - 2376