Robust Self Supervised Speech Embeddings for Child-Adult Classification in Interactions involving Children with Autism

被引:1
|
作者
Lahiri, Rimita [1 ]
Feng, Tiantian [1 ]
Hebbar, Rajat [1 ]
Lord, Catherine [2 ]
Kim, So Hyun [3 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Signal Anal & Interpretat Lab, Los Angeles, CA 90007 USA
[2] Univ Calif Los Angeles, Semel Inst Neurosci & Human Behav, Los Angeles, CA 90024 USA
[3] Korea Univ, Sch Psychol, Seoul, South Korea
来源
关键词
speech; child-adult classification; self-supervision; autism; SPECTRUM;
D O I
10.21437/Interspeech.2023-1447
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address the problem of detecting who spoke when in child-inclusive spoken interactions i.e., automatic child-adult speaker classification. Interactions involving children are richly heterogeneous due to developmental differences. The presence of neurodiversity e.g., due to Autism, contributes additional variability. We investigate the impact of additional pre-training with more unlabelled child speech on the child-adult classification performance. We pre-train our model with child-inclusive interactions, following two recent self-supervision algorithms, Wav2vec 2.0 andWavLM, with a contrastive loss objective. We report 9 - 13% relative improvement over the state-of-the-art baseline with regards to classification F1 scores on two clinical interaction datasets involving children with Autism. We also analyze the impact of pre-training under different conditions by evaluating our model on interactions involving different subgroups of children based on various demographic factors.
引用
收藏
页码:3557 / 3561
页数:5
相关论文
共 7 条
  • [1] META-LEARNING FOR ROBUST CHILD-ADULT CLASSIFICATION FROM SPEECH
    Koluguri, Nithin Rao
    Kumar, Manoj
    Kim, So Hyun
    Lord, Catherine
    Narayanan, Shrikanth
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8094 - 8098
  • [2] LEARNING DOMAIN INVARIANT REPRESENTATIONS FOR CHILD-ADULT CLASSIFICATION FROM SPEECH
    Lahiri, Rimita
    Kumar, Manoj
    Bishop, Somer
    Narayanan, Shrikanth
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6749 - 6753
  • [3] Measuring preschoolers' behavioral self-regulation in the contexts of child-adult interactions
    Wang, Shuang
    Liu, Cong
    Byrne, Elizabeth M.
    Xie, Hongbin
    CURRENT PSYCHOLOGY, 2024, 43 (16) : 14523 - 14537
  • [4] Multi-scale Context Adaptation for Improving Child Automatic Speech Recognition in Child-Adult Spoken Interactions
    Kumar, Manoj
    Bone, Daniel
    McWilliams, Kelly
    Williams, Shanna
    Lyon, Thomas D.
    Narayanan, Shrikanth
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2730 - 2734
  • [5] Children's engineering identities-in-practice: An exploration of child-adult interactions in an out-of-school context
    Simpson, Amber
    Knox, Peter N.
    Yang, Jing
    JOURNAL OF ENGINEERING EDUCATION, 2023, 112 (04) : 1056 - 1078
  • [6] The influence of maternal language responsiveness on the expressive speech production of children with autism spectrum disorders: A microanalysis of mother-child play interactions
    Walton, Katherine M.
    Ingersoll, Brooke R.
    AUTISM, 2015, 19 (04) : 421 - 432
  • [7] Inclusion of children with autism spectrum disorder in preschool: Investigation of adult-child interactions in two inclusive classes over one school year
    Despois, J.
    Andre, A.
    JOURNAL OF RESEARCH IN SPECIAL EDUCATIONAL NEEDS, 2024, 24 (03): : 786 - 795