Joint Attention for Automated Video Editing

被引:2
|
作者
Wu, Hui-Yin [1 ]
Santarra, Trevor [2 ]
Leece, Michael [3 ]
Vargas, Rolando [3 ]
Jhala, Arnav [4 ]
机构
[1] Univ Cote dAzur, INRIA, Sophia Antipolis, France
[2] Unity Technol, San Francisco, CA USA
[3] Univ Calif Santa Cruz, Santa Cruz, CA USA
[4] North Carolina State Univ, Raleigh, NC USA
关键词
smart conferencing; automated video editing; joint attention; LSTM;
D O I
10.1145/3391614.3393656
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Joint attention refers to the shared focal points of attention for occupants in a space. In this work, we introduce a computational definition of joint attention for the automated editing of meetings in multi-camera environments from the AMI corpus. Using extracted head pose and individual headset amplitude as features, we developed three editing methods: (1) a naive audio-based method that selects the camera using only the headset input, (2) a rule-based edit that selects cameras at a fixed pacing using pose data, and (3) an editing algorithm using LSTM (Long-short term memory) learned joint-attention from both pose and audio data, trained on expert edits. The methods are evaluated qualitatively against the human edit, and quantitatively in a user study with 22 participants. Results indicate that LSTM-trained joint attention produces edits that are comparable to the expert edit, offering a wider range of camera views than audio, while being more generalizable as compared to rule-based methods.
引用
收藏
页码:55 / 64
页数:10
相关论文
共 50 条
  • [41] Basic Thinking of Video Editing
    Cao, Yimei
    APPLIED ECONOMICS, BUSINESS AND DEVELOPMENT, 2011, 208 : 99 - 104
  • [42] Physically Based Video Editing
    Bazin, J-C.
    Pluss , C.
    Yu, G.
    Martin, T.
    Jacobson, A.
    Gross, M.
    COMPUTER GRAPHICS FORUM, 2016, 35 (07) : 421 - 429
  • [43] Analogies based video editing
    Yan, WQ
    Wang, J
    Kankanhalli, MS
    MULTIMEDIA SYSTEMS, 2005, 11 (01) : 3 - 18
  • [44] Narrative Annotation and Editing of Video
    Lombardo, Vincenzo
    Damiano, Rossana
    INTERACTIVE STORYTELLING, 2010, 6432 : 62 - +
  • [45] Timeline Editing of Objects in Video
    Lu, Shao-Ping
    Zhang, Song-Hai
    Wei, Jin
    Hu, Shi-Min
    Martin, Ralph R.
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (07) : 1218 - 1227
  • [46] VIDEO EDITING AND SPECIAL EFFECTS
    FERGUSON, PR
    CONFERENCE PROCEEDINGS FOR THE 1989 NAUI INTERNATIONAL CONFERENCE ON UNDERWATER EDUCATION, 1989, : 85 - 88
  • [47] Geodesic Image and Video Editing
    Criminisi, Antonio
    Sharp, Toby
    Rother, Carsten
    Perez, Patrick
    ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (05):
  • [48] Nonlinear editing by generative video
    Jasinschi, RS
    Moura, JMF
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 1220 - 1223
  • [49] ANIMATION EDITING ON VIDEO TAPE
    MANTEL, H
    JOURNAL OF THE SMPTE-SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS, 1964, 73 (07): : 561 - 563
  • [50] MODERN VIDEO TAPE EDITING
    ORR, WH
    COMMUNICATION & BROADCASTING, 1978, 4 (03): : 42 - 45