Improving Gender Identification in Movie Audio using Cross-Domain Data

被引:9
|
作者
Hebbar, Rajat [1 ]
Somandepalli, Krishna [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Signal Anal & Interpretat Lab, Dept Elect Engn, Los Angeles, CA 90007 USA
关键词
gender identification; voice activity detection; deep neural networks; recurrent neural networks; transfer learning; bi-directional long short-term memory; RECOGNITION; SPEECH;
D O I
10.21437/Interspeech.2018-1462
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gender identification from audio is an important task for quantitative gender analysis in multimedia, and to improve tasks like speech recognition. Robust gender identification requires speech segmentation that relies on accurate voice activity detection (VAD). These tasks are challenging in movie audio due to diverse and often noisy acoustic conditions. In this work, we acquire VAD labels for movie audio by aligning it with subtitle text, and train a recurrent neural network model for VAD. Subsequently, we apply transfer learning to predict gender using feature embeddings obtained from a model pre-trained for large-scale audio classification. In order to account for the diverse acoustic conditions in movie audio, we use audio clips from YouTube labeled for gender. We compare the performance of our proposed method with baseline experiments that were setup to assess the importance of feature embeddings and training data used for gender identification task. For systematic evaluation, we extend an existing benchmark dataset for movie VAD, to include precise gender labels. The VAD system shows comparable results to state-of-the-art in movie domain. The proposed gender identification system outperforms existing baselines, achieving an accuracy of 85% for movie audio. We have made the data and related code publicly available(1).
引用
收藏
页码:282 / 286
页数:5
相关论文
共 50 条
  • [21] Cross-domain Author Gender Classification in Brazilian Portuguese
    Sandroni Dias, Rafael Felipe
    Paraboni, Ivandre
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1227 - 1234
  • [22] Improving Serendipity and Accuracy in Cross-Domain Recommender Systems
    Kotkov, Denis
    Wang, Shuaiqiang
    Veijalainen, Jari
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST 2016), 2017, 292 : 105 - 119
  • [23] Cross-domain secure data sharing using blockchain for industrial IoT
    Singh, Parminder
    Masud, Mehedi
    Hossain, M. Shamim
    Kaur, Avinash
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 156 (156) : 176 - 184
  • [24] Cross-Domain Activity Recognition Using Shared Representation in Sensor Data
    Hamad, Rebeen Ali
    Yang, Longzhi
    Woo, Wai Lok
    Wei, Bo
    IEEE SENSORS JOURNAL, 2022, 22 (13) : 13273 - 13284
  • [25] Cross-Domain Neurobiology Data Integration and Exploration
    Xuan, Weijian
    Dai, Manhong
    Josh, Buckner
    Mirel, Barbara
    Song, Jean
    Athey, Brian
    Watson, Stanley J.
    Meng, Fan
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 37 - +
  • [26] Cross-domain neurobiology data integration and exploration
    Weijian Xuan
    Manhong Dai
    Josh Buckner
    Barbara Mirel
    Jean Song
    Brian Athey
    Stanley J Watson
    Fan Meng
    BMC Genomics, 11
  • [27] Modeling Treatment Effect with Cross-Domain Data
    Han, Bin
    Zhang, Ya-Lin
    Yu, Lu
    Chen, Biying
    Li, Longfei
    Zhou, Jun
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PAKDD 2024, 2024, 14645 : 365 - 377
  • [28] Cross-domain neurobiology data integration and exploration
    Xuan, Weijian
    Dai, Manhong
    Buckner, Josh
    Mirel, Barbara
    Song, Jean
    Athey, Brian
    Watson, Stanley J.
    Meng, Fan
    BMC GENOMICS, 2010, 11
  • [29] Data Poisoning Attacks on Cross-domain Recommendation
    Chen, Huiyuan
    Li, Jing
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2177 - 2180
  • [30] Cross-Domain Feature Extraction-Based Household Characteristics Identification Approach Using Smart Meter Data
    Yan, Siqing
    Li, Kangping
    Wang, Fei
    Ge, Xinxin
    Lu, Xiaoxing
    Chen, Hongyu
    Chang, Shengqiang
    2019 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2019,