An Ensemble Approach to Cross-Domain Authorship Attribution

被引:7
|
作者
Custodio, Jose Eleandro [1 ]
Paraboni, Ivandre [1 ]
机构
[1] Univ Sao Paulo, Sch Arts Sci & Humanities EACH, Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1007/978-3-030-28577-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an ensemble approach to cross-domain authorship attribution that combines predictions made by three independent classifiers, namely, standard character n-grams, character n-grams with non-diacritic distortion and word n-grams. Our proposal relies on variable-length n-gram models and multinomial logistic regression to select the prediction of highest probability among the three models as the output for the task. The present approach is compared against a number of baseline systems, and we report results based on both the PAN-CLEF 2018 test data, and on a new corpus of song lyrics in English and Portuguese.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 50 条
  • [1] A transfer learning approach to cross-domain authorship attribution
    Barlas, Georgios
    Stamatatos, Efstathios
    EVOLVING SYSTEMS, 2021, 12 (03) : 625 - 643
  • [2] A transfer learning approach to cross-domain authorship attribution
    Georgios Barlas
    Efstathios Stamatatos
    Evolving Systems, 2021, 12 : 625 - 643
  • [3] Cross-Domain Authorship Attribution Using Pre-trained Language Models
    Barlas, Georgios
    Stamatatos, Efstathios
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 255 - 266
  • [4] An Ensemble Approach for Dutch Cross-Domain Hate Speech Detection
    Markov, Ilia
    Gevers, Ine
    Daelemans, Walter
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 3 - 15
  • [5] Cross-domain Ensemble Distillation for Domain Generalization
    Lee, Kyungmoon
    Kim, Sungyeon
    Kwak, Suha
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 1 - 20
  • [6] Overview of PAN 2019: Bots and Gender Profiling, Celebrity Profiling, Cross-Domain Authorship Attribution and Style Change Detection
    Daelemans, Walter
    Kestemont, Mike
    Manjavacas, Enrique
    Potthast, Martin
    Rangel, Francisco
    Rosso, Paolo
    Specht, Guenther
    Stamatatos, Efstathios
    Stein, Benno
    Tschuggnall, Michael
    Wiegmann, Matti
    Zangerle, Eva
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION (CLEF 2019), 2019, 11696 : 402 - 416
  • [7] An Ensemble Model for Cross-Domain Polarity Classification on Twitter
    Tsakalidis, Adam
    Papadopoulos, Symeon
    Kompatsiaris, Ioannis
    WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 168 - 177
  • [8] An Approach for Cross-Domain Intrusion Detection
    Thuy Nguyen
    Gondree, Mark
    Khosalim, Jean
    Shifflett, David
    Levin, Timothy
    Irvine, Cynthia
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATION WARFARE AND SECURITY, 2012, : 203 - 212
  • [9] A New Approach for Authorship Attribution
    Reddy, P. Buddha
    Reddy, T. Raghunadha
    Chand, M. Gopi
    Venkannababu, A.
    INFORMATION AND DECISION SCIENCES, 2018, 701 : 1 - 9
  • [10] Cross-Domain Recommendation: An Embedding and Mapping Approach
    Man, Tong
    Shen, Huawei
    Jin, Xiaolong
    Cheng, Xueqi
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2464 - 2470