Gender classification of product reviewers in China: a data-driven approach

被引:0
|
作者
Wang, Jing [1 ]
Yan, Xiangbin [2 ]
Zhu, Bin [3 ]
机构
[1] Commun Univ China, Sch Econ & Management, Dept Management Sci & Engn, 1 Dingfuzhuang East St, Beijing, Peoples R China
[2] Univ Sci & Technol Beijing, Donlinks Sch Econ & Management, Dept Management Sci & Engn, 30 Xueyuan Rd, Beijing, Peoples R China
[3] Oregon State Univ, Coll Business, Dept Business Informat Syst, 2751 SW Jefferson Way, Corvallis, OR USA
基金
中国国家自然科学基金;
关键词
Text mining; Gender classification; Chinese gender lexicon; Na & iuml; ve Bayesian; BP neural network; Support vector machines; ONLINE; DISCOURSE; EMOTION; AUTHOR;
D O I
10.1007/s10799-024-00443-0
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Online product discussion forums have become essential resources for marketers seeking to understand market dynamics and consumer preferences. Identifying the gender of forum participants can further enhance the effectiveness and efficiency of marketing efforts. However, the relationship between linguistic features and gender classification often varies due to contextual factors such as genres, social networks, and social classes. Recognizing that the discriminatory power of gender markers changes with context, this study proposes and validates a framework to guide the adoption of existing gender classification systems specifically for online product discussions. We demonstrate that beyond optimizing the classification methods themselves, performance can be improved by strategically applying these methods to archived discussion data. Our findings reveal that, for a given classification method and discussion forum, the size of the input data significantly influences performance, with an optimal data size existing to achieve the best results.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Data-driven Marketing Expected in China
    Richard Zhu
    China's Foreign Trade, 2014, (01) : 32 - 32
  • [42] Challenges of data-driven methods in product development
    Mehlstäubl J.
    Gadzo E.
    Atzberger A.
    Paetzold K.
    Konstruktion, 2022, 74 (06): : 60 - 66
  • [43] A new data-driven method for microarray data classification
    Pugalendhi, Ganeshkumar
    Vijayakumar, Ammu
    Kim, Ku-Jin
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 15 (02) : 101 - 124
  • [44] Data-Driven Interval Granulation Approach Based on Uncertainty Principle for Efficient Classification
    Wu, Chengying
    Zhang, Qinghua
    Yin, Longjun
    Xie, Qin
    Luo, Nanfang
    Wang, Guoyin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (01) : 12 - 26
  • [45] A Novel Control-Performance-Oriented Data-Driven Fault Classification Approach
    Liu, Tianyu
    Luo, Hao
    Kaynak, Okyay
    Yin, Shen
    IEEE SYSTEMS JOURNAL, 2020, 14 (02): : 1830 - 1839
  • [46] A Data-driven Approach to the Classification of Temporary Captures in the Earth-Moon System
    Wolfe, Sean
    Emami, M. Reza
    2024 IEEE AEROSPACE CONFERENCE, 2024,
  • [47] A data-driven framework to new product demand prediction: Integrating product differentiation and transfer learning approach
    Afrin, Kahkashan
    Nepal, Bimal
    Monplaisir, Leslie
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 246 - 257
  • [48] A Data-Driven Approach to Vibrotactile Data Compression
    Liu, Xun
    Dohler, Mischa
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 341 - 346
  • [49] A Data-Driven Classification of Video Game Vocabulary
    Grelier, Nicolas
    Kaufmann, Stephane
    ENTERTAINMENT COMPUTING, ICEC 2023, 2023, 14455 : 17 - 30
  • [50] Data-driven classification of ligand unbinding pathways
    Ray, Dhiman
    Parrinello, Michele
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2024, 121 (10)