An Ensemble Approach to Cross-Domain Authorship Attribution

被引:7
|
作者
Custodio, Jose Eleandro [1 ]
Paraboni, Ivandre [1 ]
机构
[1] Univ Sao Paulo, Sch Arts Sci & Humanities EACH, Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1007/978-3-030-28577-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an ensemble approach to cross-domain authorship attribution that combines predictions made by three independent classifiers, namely, standard character n-grams, character n-grams with non-diacritic distortion and word n-grams. Our proposal relies on variable-length n-gram models and multinomial logistic regression to select the prediction of highest probability among the three models as the output for the task. The present approach is compared against a number of baseline systems, and we report results based on both the PAN-CLEF 2018 test data, and on a new corpus of song lyrics in English and Portuguese.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 50 条
  • [21] A global reweighting approach for cross-domain semantic segmentation
    Zhang, Yuhang
    Tian, Shishun
    Liao, Muxin
    Hua, Guoguang
    Zou, Wenbin
    Xu, Chen
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 130
  • [22] Domain-ensemble learning with cross-domain mixup for thoracic disease classification in unseen domains
    Wang, Hongyu
    Xia, Yong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [23] Dropout-Based Ensemble Dual Discriminator for Cross-Domain Sentiment Classification
    Wei, Xing
    Wang, Xiuxiu
    Zhang, Li
    Chen, Lei
    Luo, Hui
    Wu, Di
    Zhao, Chong
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 526 - 538
  • [24] Multi-source BERT stack ensemble for cross-domain author profiling
    Delmondes Neto, Jose Pereira
    Paraboni, Ivandre
    EXPERT SYSTEMS, 2022, 39 (03)
  • [25] CRD-SentEnse: Cross-domain Sentiment Analysis using an Ensemble Model
    Katsarou, Katerina
    Shekhawat, Devvrat Singh
    11TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES), 2019, : 88 - 94
  • [26] Ensemble Transfer Learning Based Cross-Domain UAV Actuator Fault Detection
    Liu, Datong
    Wang, Na
    Guo, Kai
    Wang, Benkuan
    IEEE SENSORS JOURNAL, 2023, 23 (14) : 16363 - 16372
  • [27] Cross-domain symbiosis
    Andrea Du Toit
    Nature Reviews Microbiology, 2022, 20 (11) : 638 - 638
  • [28] Comparison of Cross-Validation and Test Sets Approaches to Evaluation of Classifiers in Authorship Attribution Domain
    Baron, Grzegorz
    COMPUTER AND INFORMATION SCIENCES, ISCIS 2016, 2016, 659 : 81 - 89
  • [29] Bucketed common vector scaling for authorship attribution in heterogeneous web collections: A scaling approach for authorship attribution
    Agun, Hayri Volkan
    Yilmazel, Ozgur
    JOURNAL OF INFORMATION SCIENCE, 2020, 46 (05) : 683 - 695
  • [30] Document Embedding Approach for Efficient Authorship Attribution
    Agun, Hayri Volkan
    Yilmazel, Ozgur
    PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND APPLICATIONS (ICKEA), 2017, : 194 - 198