SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision Viewers

被引:3
|
作者
Ning, Zheng [1 ]
Wimer, Brianna L. [1 ]
Jiang, Kaiwen [2 ]
Chen, Keyi [2 ]
Ban, Jerrick [1 ]
Tian, Yapeng [3 ]
Zhao, Yuhang [4 ]
Li, Toby Jia-Jun [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] Univ Calif San Diego, La Jolla, CA USA
[3] Univ Texas Dallas, Richardson, TX USA
[4] Univ Wisconsin Madison, Madison, WI USA
关键词
audio description; video consumption; accessibility; COGNITIVE APPROACH;
D O I
10.1145/3613904.3642632
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind or Low-Vision (BLV) users often rely on audio descriptions (AD) to access video content. However, conventional static ADs can leave out detailed information in videos, impose a high mental load, neglect the diverse needs and preferences of BLV users, and lack immersion. To tackle these challenges, we introduce Spica, an AI-powered system that enables BLV users to interactively explore video content. Informed by prior empirical studies on BLV video consumption, Spica offers interactive mechanisms for supporting temporal navigation of frame captions and spatial exploration of objects within key frames. Leveraging an audio-visual machine learning pipeline, Spica augments existing ADs by adding interactivity, spatial sound effects, and individual object descriptions without requiring additional human annotation. Through a user study with 14 BLV participants, we evaluated the usability and usefulness of Spica and explored user behaviors, preferences, and mental models when interacting with augmented ADs.
引用
收藏
页数:18
相关论文
共 12 条
  • [1] Musical Performances in Virtual Reality with Spatial and View-Dependent Audio Descriptions for Blind and Low-Vision Users
    Dang, Khang
    Lee, Sooyeon
    PROCEEDINGS OF THE 26TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, ASSETS 2024, 2024,
  • [2] Characterizing and Predicting Engagement of Blind and Low-Vision People with an Audio-Based Navigation App
    Liu, Tifany
    Hernandez, Javier
    Gonzalez, Mar
    Maselli, Antonella
    Kneisel, Melanie
    Glass, Adam
    Chudge, Jarnail
    Miller, Amos
    EXTENDED ABSTRACTS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2022, 2022,
  • [3] Enhancing the entertainment experience of blind and low-vision theatregoers through touch tours
    Udo, J. P.
    Fels, D. I.
    DISABILITY & SOCIETY, 2010, 25 (02) : 231 - 240
  • [4] Beyond Audio Description: Exploring 360° Video Accessibility with Blind and Low Vision Users Through Collaborative Creation
    Jiang, Lucy
    Phutane, Mahika
    Azenkot, Shiri
    PROCEEDINGS OF THE 25TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, ASSETS 2023, 2023,
  • [5] The Development of a New Theatrical Tradition: Sighted Students Audio Describe School Play for a Blind and Low-Vision Audience
    Udo, J.
    Fels, Deborah
    INTERNATIONAL JOURNAL OF EDUCATION AND THE ARTS, 2009, 10 (20): : 1 - 27
  • [6] Images, Words, and Imagination: Accessible Descriptions to Support Blind and Low Vision Art Exploration and Engagement
    Doore, Stacy A.
    Istrati, David
    Xu, Chenchang
    Qiu, Yixuan
    Sarrazin, Anais
    Giudice, Nicholas A.
    JOURNAL OF IMAGING, 2024, 10 (01)
  • [7] Horatio audio-describes Shakespeare's Hamlet Blind and low-vision theatre-goers evaluate an unconventional audio description strategy
    Udo, J. . P.
    Acevedo, B.
    Fels, D. I.
    BRITISH JOURNAL OF VISUAL IMPAIRMENT, 2010, 28 (02) : 139 - 156
  • [8] Assessing Mobility of Blind and Low-Vision Individuals Through a Portable Virtual Reality System and a Comprehensive Questionnaire
    Isaksson-Daun, Johan
    Jansson, Tomas
    Nilsson, Johan
    IEEE ACCESS, 2024, 12 : 146089 - 146106
  • [9] Conveying the pathogenesis of type 1 diabetes to the blind, low-vision and diverse needs communities through sensory stimulation
    Tran, Mai T.
    Ciacchi, Laura
    Ciacchi, Lisa
    Reid, Hugh H.
    IMMUNOLOGY AND CELL BIOLOGY, 2024, 102 (05): : 341 - 346
  • [10] Understanding How to Inform Blind and Low-Vision Users about Data Privacy through Privacy Question Answering Assistants
    Feng, Yuanyuan
    Ravichander, Abhilasha
    Yao, Yaxing
    Zhang, Shikun
    Chen, Rex
    Wilson, Shomir
    Sadeh, Norman
    PROCEEDINGS OF THE 33RD USENIX SECURITY SYMPOSIUM, SECURITY 2024, 2024, : 2065 - 2082