Joint Attention for Automated Video Editing

被引:2
|
作者
Wu, Hui-Yin [1 ]
Santarra, Trevor [2 ]
Leece, Michael [3 ]
Vargas, Rolando [3 ]
Jhala, Arnav [4 ]
机构
[1] Univ Cote dAzur, INRIA, Sophia Antipolis, France
[2] Unity Technol, San Francisco, CA USA
[3] Univ Calif Santa Cruz, Santa Cruz, CA USA
[4] North Carolina State Univ, Raleigh, NC USA
关键词
smart conferencing; automated video editing; joint attention; LSTM;
D O I
10.1145/3391614.3393656
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Joint attention refers to the shared focal points of attention for occupants in a space. In this work, we introduce a computational definition of joint attention for the automated editing of meetings in multi-camera environments from the AMI corpus. Using extracted head pose and individual headset amplitude as features, we developed three editing methods: (1) a naive audio-based method that selects the camera using only the headset input, (2) a rule-based edit that selects cameras at a fixed pacing using pose data, and (3) an editing algorithm using LSTM (Long-short term memory) learned joint-attention from both pose and audio data, trained on expert edits. The methods are evaluated qualitatively against the human edit, and quantitatively in a user study with 22 participants. Results indicate that LSTM-trained joint attention produces edits that are comparable to the expert edit, offering a wider range of camera views than audio, while being more generalizable as compared to rule-based methods.
引用
收藏
页码:55 / 64
页数:10
相关论文
共 50 条
  • [31] ENRICHED VIDEO EDITING
    OLIVER, D
    COMPUTER GRAPHICS WORLD, 1994, 17 (12) : 20 - &
  • [32] VIDEO EDITING TECHNIQUE
    不详
    BELL LABORATORIES RECORD, 1968, 46 (11): : 385 - &
  • [33] Seamless video editing
    Wang, HC
    Raskar, R
    Ahuja, N
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 858 - 861
  • [34] Editing digital video
    不详
    TRAINING & DEVELOPMENT, 2001, 55 (01): : 71 - 71
  • [35] IS JOINT ATTENTION DETECTABLE at a DISTANCE? THREE AUTOMATED, INTERNET-BASED TESTS
    Sheldrake, Rupert
    Beeharee, Ashweeni
    EXPLORE-THE JOURNAL OF SCIENCE AND HEALING, 2016, 12 (01) : 34 - 41
  • [36] An efficient framework for quantum video and video editing
    Wei, Zhanhong
    Sun, Wentao
    Zhu, Shangchao
    Han, Mengdi
    Yin, Huijuan
    INTERNATIONAL JOURNAL OF QUANTUM INFORMATION, 2023, 21 (05)
  • [37] Look At That! Video Chat and Joint Visual Attention Development Among Babies and Toddlers
    McClure, Elisabeth R.
    Chentsova-Dutton, Yulia E.
    Holochwost, Steven J.
    Parrott, W. G.
    Barr, Rachel
    CHILD DEVELOPMENT, 2018, 89 (01) : 27 - 36
  • [38] The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing
    Argaw, Dawit Mureja
    Heilbron, Fabian Caba
    Lee, Joon-Young
    Woodson, Markus
    Kweon, In So
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 201 - 218
  • [39] Interactive Intrinsic Video Editing
    Bonneel, Nicolas
    Sunkavalli, Kalyan
    Tompkin, James
    Sun, Deqing
    Paris, Sylvain
    Pfister, Hanspeter
    ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06):
  • [40] FILM EDITING THE VIDEO WAY
    LANG, S
    LANG, S
    INDUSTRIAL PHOTOGRAPHY, 1984, 33 (09): : 34 - 35