Gender screening on question-answering communities

被引:2
|
作者
Figueroa, Alejandro [1 ]
Peralta, Billy [1 ]
Nicolis, Orietta [1 ]
机构
[1] Univ Andres Bello, Fac Ingn, Dept Ciencias Ingn, Antonio Varas 880, Santiago, Chile
关键词
Gender recognition; Community question answering; Pre-trained models; Deep neural networks; Statistical methods; Expert systems; DEMOGRAPHICS; NETWORKS;
D O I
10.1016/j.eswa.2022.119405
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern community Question Answering (cQA) platforms encourage their members to publish any sort of question, which later on can get numerous answers from other community peers. However, in this dynamic, there is an intrinsic delay from the moment questions are posted until the arrival of acceptable and/or diverse responses. Therefore, cQA platforms have the pressing need for promoting unresolved questions to potential answerers, while also reducing gender disparity across their topics, for example. Needless to say, demographic analysis occupies a crucial role in successfully responding to these challenges.Nonetheless, there are only a handful of studies dissecting automatic gender recognition across cQA fellows. As far as we know, this work is the first effort to tease out the contribution to this task of the different kinds of textual inputs contained in their profiles (i.e., question titles and bodies, answers and self-descriptions). With this goal, we compare three different types of machine learning approaches under several combinations of these four input signals: traditional neural networks (e.g., RCNN and CNN), fine-tuned pre-trained transformers (e.g., BERT and RoBERTa) and statistical methods enriched with hand-crafted linguistic features (e.g., Bayes and MaxEnt).In a nutshell, our results show that pre-trained transformers are superior when dealing with full questions, conventional neural networks when mixing diverse text signals, and statistical methods when the dataset encompasses mostly noisy user-generated content, namely answers. In addition, our in-depth analysis reveals that dependency parsing is instrumental in designing hand-crafted features capable of modelling topic information, and that both genders are conspicuously represented by some specific topic distributions.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] FORMAL METHODS IN DESIGN OF QUESTION-ANSWERING SYSTEMS
    SANDEWALL, E
    ARTIFICIAL INTELLIGENCE, 1971, 2 (02) : 129 - 145
  • [42] Incorporation of question segregation procedures in visual question-answering models
    Chowdhury, Souvik
    Soni, Badal
    Phukan, Doli
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2024, 20 (02) : 99 - 108
  • [43] Questioning the Question - Addressing the Answerability of Questions in Community Question-Answering
    Shah, Chirag
    Kitzie, Vanessa
    Choi, Erik
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 1386 - 1395
  • [44] An Arabic Question-Answering system for factoid questions
    Brini, Wissal
    Ellouze, Mariem
    Mesfar, Slim
    Belguith, Lamia Hadrich
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 417 - +
  • [45] INDEPENDENCE OF QUESTION-ANSWERING STRATEGY AND SEARCHED REPRESENTATION
    SINGER, M
    MEMORY & COGNITION, 1991, 19 (02) : 189 - 196
  • [46] STORAGE ECONOMY OF INFERENTIAL QUESTION-ANSWERING SYSTEMS
    PEARL, J
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1975, 5 (06): : 595 - 602
  • [47] Information selection strategies in question-answering tasks
    Cerdan, Raquel
    Gilabert, Ramiro
    Vidal-Abarca, Eduardo
    INFANCIA Y APRENDIZAJE, 2010, 33 (04): : 449 - 460
  • [48] FIDJI: Web Question-Answering at Quaero 2009
    Tannier, Xavier
    Moriceau, Veronique
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2375 - 2379
  • [50] A Non-Factoid Question-Answering Taxonomy
    Bolotova, Valeriia
    Blinov, Vladislav
    Scholer, Falk
    Croft, W. Bruce
    Sanderson, Mark
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1196 - 1207