Gender screening on question-answering communities

被引:2
|
作者
Figueroa, Alejandro [1 ]
Peralta, Billy [1 ]
Nicolis, Orietta [1 ]
机构
[1] Univ Andres Bello, Fac Ingn, Dept Ciencias Ingn, Antonio Varas 880, Santiago, Chile
关键词
Gender recognition; Community question answering; Pre-trained models; Deep neural networks; Statistical methods; Expert systems; DEMOGRAPHICS; NETWORKS;
D O I
10.1016/j.eswa.2022.119405
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern community Question Answering (cQA) platforms encourage their members to publish any sort of question, which later on can get numerous answers from other community peers. However, in this dynamic, there is an intrinsic delay from the moment questions are posted until the arrival of acceptable and/or diverse responses. Therefore, cQA platforms have the pressing need for promoting unresolved questions to potential answerers, while also reducing gender disparity across their topics, for example. Needless to say, demographic analysis occupies a crucial role in successfully responding to these challenges.Nonetheless, there are only a handful of studies dissecting automatic gender recognition across cQA fellows. As far as we know, this work is the first effort to tease out the contribution to this task of the different kinds of textual inputs contained in their profiles (i.e., question titles and bodies, answers and self-descriptions). With this goal, we compare three different types of machine learning approaches under several combinations of these four input signals: traditional neural networks (e.g., RCNN and CNN), fine-tuned pre-trained transformers (e.g., BERT and RoBERTa) and statistical methods enriched with hand-crafted linguistic features (e.g., Bayes and MaxEnt).In a nutshell, our results show that pre-trained transformers are superior when dealing with full questions, conventional neural networks when mixing diverse text signals, and statistical methods when the dataset encompasses mostly noisy user-generated content, namely answers. In addition, our in-depth analysis reveals that dependency parsing is instrumental in designing hand-crafted features capable of modelling topic information, and that both genders are conspicuously represented by some specific topic distributions.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Refining fine-tuned transformers with hand-crafted features for gender screening on question-answering communities
    Figueroa, Alejandro
    INFORMATION FUSION, 2023, 92 : 256 - 267
  • [2] Recommending QA Documents for Communities of Question-Answering Websites
    Liu, Duen-Ren
    Huang, Chun-Kai
    Chen, Yu-Hsuan
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 139 - 147
  • [3] QA document recommendations for communities of question-answering websites
    Liu, Duen-Ren
    Chen, Yu-Hsuan
    Huang, Chun-Kai
    KNOWLEDGE-BASED SYSTEMS, 2014, 57 : 146 - 160
  • [4] Question-answering system
    Stupina, A. A.
    Zhukov, E. A.
    Ezhemanskaya, S. N.
    Karaseva, M. V.
    Korpacheva, L. N.
    XII INTERNATIONAL SCIENTIFIC AND RESEARCH CONFERENCE TOPICAL ISSUES IN AERONAUTICS AND ASTRONAUTICS, 2016, 155
  • [5] Discovering Knowledge-Sharing Communities in Question-Answering Forums
    Bouguessa, Mohamed
    Wang, Shengrui
    Dumoulin, Benoit
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 5 (01)
  • [6] Chinese question-answering system
    Huang, GT
    Yao, HH
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (04) : 479 - 488
  • [7] Chinese question-answering system
    Gai-Tai Huang
    Hsiu-Hsen Yao
    Journal of Computer Science and Technology, 2004, 19
  • [8] Answer formulation for question-answering
    Kosseim, L
    Plamondon, L
    Guillemette, LJ
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 24 - 34
  • [9] QUESTION-ANSWERING STRATEGIES FOR CHILDREN
    RAPHAEL, TE
    READING TEACHER, 1982, 36 (02): : 186 - 190
  • [10] Neural age screening on question answering communities
    Timilsina, Mohan
    Figueroa, Alejandro
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123