Space identification of sexual harassment reports with text mining

被引:4
|
作者
Karami A. [1 ]
Swan S. [2 ]
Moraes M.F. [3 ]
机构
[1] School of Information Science, University of South Carolina, Columbia, SC
[2] Department of Psychology, University of South Carolina, Columbia, SC
[3] Computer Science and Engineering Department, University of South Carolina, Columbia, SC
关键词
classification; sexual harassment; space; text mining;
D O I
10.1002/pra2.265
中图分类号
学科分类号
摘要
Sexual harassment is an invisible problem that has been difficult to combat because victims are often reluctant to report. However, within the past years, the sheer volume of women who have spoken up about sexual harassment has brought the issue to the forefront. This change has been largely driven, in part, by Internet and social media technologies. Given the large size of data posted on these online technologies, it is impossible to manually analyze and organize it; therefore, there is a need to utilize data and text mining methods. In order to help the fight against sexual harassment, this study proposes a predictive framework to collect more than 14,000 sexual harassment reports on the everyday sexism project (ESP) website and identify the space (location) in the reports. Our framework achieves 85.33% accuracy for seven space classes including workplace, public space, home, public transport, school, university, and media. This paper also enriches experiments by merging similar classes (e.g., school and university) and applies a feature selection method to reduce the number of features for efficiency and effectiveness purposes. This enrichment process offers promising results for different sets of classes and features, ranging from 86% – 93% accuracy. 83rd Annual Meeting of the Association for Information Science & Technology October 25-29, 2020. Author(s) retain copyright, but ASIS&T receives an exclusive publication license.
引用
收藏
相关论文
共 50 条
  • [1] A Systematic Literature Review of Sexual Harassment Studies with Text Mining
    Karami, Amir
    Spinel, Melek Yildiz
    White, C. Nicole
    Ford, Kayla
    Swan, Suzanne
    SUSTAINABILITY, 2021, 13 (12)
  • [2] Unwanted advances in higher education:Uncovering sexual harassment experiences in academia with text mining
    Karami, Amir
    White, Cynthia Nicole
    Ford, Kayla
    Swan, Suzanne
    Spinel, Melek Yildiz
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
  • [3] Text Mining in Radiology Reports
    Kocatekin, Tugberk
    Unay, Devrim
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [4] Text Mining in Radiology Reports
    Gong, Tianxia
    Tan, Chew Lim
    Leong, Tze Yun
    Lee, Cheng Kiang
    Pang, Boon Chuan
    Lim, C. C. Tchoyoson
    Tian, Qi
    Tang, Suisheng
    Zhang, Zhuo
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 815 - +
  • [5] THE IDENTIFICATION AND CLASSIFICATION OF REACTIONS TO SEXUAL HARASSMENT
    TERPSTRA, DE
    BAKER, DD
    JOURNAL OF ORGANIZATIONAL BEHAVIOR, 1989, 10 (01) : 1 - 14
  • [6] Political Differences in American Reports of Sexual Harassment and Assault
    Jose, Rupa
    Fowler, James H.
    Raj, Anita
    JOURNAL OF INTERPERSONAL VIOLENCE, 2021, 36 (15-16) : 7695 - 7721
  • [7] Residents' and medical students' reports of sexual harassment and discrimination
    Baldwin, DC
    Daugherty, SR
    Rowley, BD
    ACADEMIC MEDICINE, 1996, 71 (10) : S25 - S27
  • [8] Linguistic Text Mining for Problem Reports
    Malin, Jane T.
    Throop, David R.
    Millward, Christopher
    Schwarz, Hansen A.
    Gomez, Fernando
    Thronesbery, Carroll
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 1578 - +
  • [9] Text mining brain imaging reports
    Beatrice Alex
    Claire Grover
    Richard Tobin
    Cathie Sudlow
    Grant Mair
    William Whiteley
    Journal of Biomedical Semantics, 10
  • [10] Text mining brain imaging reports
    Alex, Beatrice
    Grover, Claire
    Tobin, Richard
    Sudlow, Cathie
    Mair, Grant
    Whiteley, William
    JOURNAL OF BIOMEDICAL SEMANTICS, 2019, 10 (Suppl 1)