Joint Attention for Automated Video Editing

被引:2
|
作者
Wu, Hui-Yin [1 ]
Santarra, Trevor [2 ]
Leece, Michael [3 ]
Vargas, Rolando [3 ]
Jhala, Arnav [4 ]
机构
[1] Univ Cote dAzur, INRIA, Sophia Antipolis, France
[2] Unity Technol, San Francisco, CA USA
[3] Univ Calif Santa Cruz, Santa Cruz, CA USA
[4] North Carolina State Univ, Raleigh, NC USA
关键词
smart conferencing; automated video editing; joint attention; LSTM;
D O I
10.1145/3391614.3393656
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Joint attention refers to the shared focal points of attention for occupants in a space. In this work, we introduce a computational definition of joint attention for the automated editing of meetings in multi-camera environments from the AMI corpus. Using extracted head pose and individual headset amplitude as features, we developed three editing methods: (1) a naive audio-based method that selects the camera using only the headset input, (2) a rule-based edit that selects cameras at a fixed pacing using pose data, and (3) an editing algorithm using LSTM (Long-short term memory) learned joint-attention from both pose and audio data, trained on expert edits. The methods are evaluated qualitatively against the human edit, and quantitatively in a user study with 22 participants. Results indicate that LSTM-trained joint attention produces edits that are comparable to the expert edit, offering a wider range of camera views than audio, while being more generalizable as compared to rule-based methods.
引用
收藏
页码:55 / 64
页数:10
相关论文
共 50 条
  • [1] Automated video tape editing system
    SHIMADA R
    AKATSUKA S
    Toshiba Review, 1971, (62): : 5 - 10
  • [2] The application of video semantics and theme representation in automated video editing
    Nack, F
    Parkes, A
    MULTIMEDIA TOOLS AND APPLICATIONS, 1997, 4 (01) : 57 - 83
  • [3] The Application of Video Semantics and Theme Representation in Automated Video Editing
    Frank Nack
    Alan Parkes
    Multimedia Tools and Applications, 1997, 4 (1) : 57 - 83
  • [4] Automated Multi-Modal Video Editing for Ads Video
    Lin, Qin
    Pang, Nuo
    Hong, Zhiying
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4823 - 4827
  • [5] Automated Video Editing for Aesthetic Quality Improvement
    Choi, Jun-Ho
    Lee, Jong-Seok
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1003 - 1006
  • [6] AN AUTOMATED VIDEO-TAPE EDITING SYSTEM
    CAMPBELL, KD
    JOURNAL OF THE SOCIETY OF MOTION PICTURE TELEVISION ENGINEERS, 1970, 79 (03): : 191 - &
  • [7] Generative Methods for Automated Music Video Editing
    Stefan, Julia
    ENTERTAINMENT COMPUTING - ICEC 2014, 2014, 8770 : 226 - 228
  • [8] Editing like Humans: A Contextual, Multimodal Framework for Automated Video Editing
    Koorathota, Sharath
    Adelman, Patrick
    Cotton, Kelly
    Sajda, Paul
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1701 - 1709
  • [9] Video quality analysis for an automated video capturing and editing system for conversation scenes
    Nishizaki, T
    Ogata, R
    Kameda, Y
    Ohta, Y
    Nakamura, Y
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 678 - 681
  • [10] Speaker Identification for the Analysis of Joint Attention in Video
    Gonzalez Contreras, Carlos Eduardo
    De-la-Torre, Miguel
    Gonzalez Becerra, Victor Hugo
    Avila-George, Himer
    Hernandez Palacio, Raul
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS IMPROVEMENT (CIMPS), 2019,