Gender screening on question-answering communities

被引：2

作者：

Figueroa, Alejandro ^{[1
]}

Peralta, Billy ^{[1
]}

Nicolis, Orietta ^{[1
]}

机构：

[1] Univ Andres Bello, Fac Ingn, Dept Ciencias Ingn, Antonio Varas 880, Santiago, Chile

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2023年 / 215卷

关键词：

Gender recognition; Community question answering; Pre-trained models; Deep neural networks; Statistical methods; Expert systems; DEMOGRAPHICS; NETWORKS;

D O I：

10.1016/j.eswa.2022.119405

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Modern community Question Answering (cQA) platforms encourage their members to publish any sort of question, which later on can get numerous answers from other community peers. However, in this dynamic, there is an intrinsic delay from the moment questions are posted until the arrival of acceptable and/or diverse responses. Therefore, cQA platforms have the pressing need for promoting unresolved questions to potential answerers, while also reducing gender disparity across their topics, for example. Needless to say, demographic analysis occupies a crucial role in successfully responding to these challenges.Nonetheless, there are only a handful of studies dissecting automatic gender recognition across cQA fellows. As far as we know, this work is the first effort to tease out the contribution to this task of the different kinds of textual inputs contained in their profiles (i.e., question titles and bodies, answers and self-descriptions). With this goal, we compare three different types of machine learning approaches under several combinations of these four input signals: traditional neural networks (e.g., RCNN and CNN), fine-tuned pre-trained transformers (e.g., BERT and RoBERTa) and statistical methods enriched with hand-crafted linguistic features (e.g., Bayes and MaxEnt).In a nutshell, our results show that pre-trained transformers are superior when dealing with full questions, conventional neural networks when mixing diverse text signals, and statistical methods when the dataset encompasses mostly noisy user-generated content, namely answers. In addition, our in-depth analysis reveals that dependency parsing is instrumental in designing hand-crafted features capable of modelling topic information, and that both genders are conspicuously represented by some specific topic distributions.

引用

页数：17

共 50 条

[1] Refining fine-tuned transformers with hand-crafted features for gender screening on question-answering communities
Figueroa, Alejandro
INFORMATION FUSION, 2023, 92 : 256 - 267
[2] Recommending QA Documents for Communities of Question-Answering Websites
Liu, Duen-Ren
Huang, Chun-Kai
Chen, Yu-Hsuan
INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 139 - 147
[3] QA document recommendations for communities of question-answering websites
Liu, Duen-Ren
Chen, Yu-Hsuan
Huang, Chun-Kai
KNOWLEDGE-BASED SYSTEMS, 2014, 57 : 146 - 160
[4] Question-answering system
Stupina, A. A.
Zhukov, E. A.
Ezhemanskaya, S. N.
Karaseva, M. V.
Korpacheva, L. N.
XII INTERNATIONAL SCIENTIFIC AND RESEARCH CONFERENCE TOPICAL ISSUES IN AERONAUTICS AND ASTRONAUTICS, 2016, 155
[5] Discovering Knowledge-Sharing Communities in Question-Answering Forums
Bouguessa, Mohamed
Wang, Shengrui
Dumoulin, Benoit
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 5 (01)
[6] Chinese question-answering system
Huang, GT
Yao, HH
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (04) : 479 - 488
[7] Chinese question-answering system
Gai-Tai Huang
Hsiu-Hsen Yao
Journal of Computer Science and Technology, 2004, 19
[8] Answer formulation for question-answering
Kosseim, L
Plamondon, L
Guillemette, LJ
ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 24 - 34
[9] QUESTION-ANSWERING STRATEGIES FOR CHILDREN
RAPHAEL, TE
READING TEACHER, 1982, 36 (02): : 186 - 190
[10] Neural age screening on question answering communities
Timilsina, Mohan
Figueroa, Alejandro
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123

← 1 2 3 4 5 →