Joint Attention for Automated Video Editing

被引：2

作者：

Wu, Hui-Yin ^{[1
]}

Santarra, Trevor ^{[2
]}

Leece, Michael ^{[3
]}

Vargas, Rolando ^{[3
]}

Jhala, Arnav ^{[4
]}

机构：

[1] Univ Cote dAzur, INRIA, Sophia Antipolis, France

[2] Unity Technol, San Francisco, CA USA

[3] Univ Calif Santa Cruz, Santa Cruz, CA USA

[4] North Carolina State Univ, Raleigh, NC USA

来源：

PROCEEDINGS OF THE 2020 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE MEDIA EXPERIENCES, IMX 2020 | 2020年

关键词：

smart conferencing; automated video editing; joint attention; LSTM;

D O I：

10.1145/3391614.3393656

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Joint attention refers to the shared focal points of attention for occupants in a space. In this work, we introduce a computational definition of joint attention for the automated editing of meetings in multi-camera environments from the AMI corpus. Using extracted head pose and individual headset amplitude as features, we developed three editing methods: (1) a naive audio-based method that selects the camera using only the headset input, (2) a rule-based edit that selects cameras at a fixed pacing using pose data, and (3) an editing algorithm using LSTM (Long-short term memory) learned joint-attention from both pose and audio data, trained on expert edits. The methods are evaluated qualitatively against the human edit, and quantitatively in a user study with 22 participants. Results indicate that LSTM-trained joint attention produces edits that are comparable to the expert edit, offering a wider range of camera views than audio, while being more generalizable as compared to rule-based methods.

引用

页码：55 / 64

页数：10

共 50 条

[41] Basic Thinking of Video Editing
Cao, Yimei
APPLIED ECONOMICS, BUSINESS AND DEVELOPMENT, 2011, 208 : 99 - 104
[42] Physically Based Video Editing
Bazin, J-C.
Pluss , C.
Yu, G.
Martin, T.
Jacobson, A.
Gross, M.
COMPUTER GRAPHICS FORUM, 2016, 35 (07) : 421 - 429
[43] Analogies based video editing
Yan, WQ
Wang, J
Kankanhalli, MS
MULTIMEDIA SYSTEMS, 2005, 11 (01) : 3 - 18
[44] Narrative Annotation and Editing of Video
Lombardo, Vincenzo
Damiano, Rossana
INTERACTIVE STORYTELLING, 2010, 6432 : 62 - +
[45] Timeline Editing of Objects in Video
Lu, Shao-Ping
Zhang, Song-Hai
Wei, Jin
Hu, Shi-Min
Martin, Ralph R.
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (07) : 1218 - 1227
[46] VIDEO EDITING AND SPECIAL EFFECTS
FERGUSON, PR
CONFERENCE PROCEEDINGS FOR THE 1989 NAUI INTERNATIONAL CONFERENCE ON UNDERWATER EDUCATION, 1989, : 85 - 88
[47] Geodesic Image and Video Editing
Criminisi, Antonio
Sharp, Toby
Rother, Carsten
Perez, Patrick
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (05):
[48] Nonlinear editing by generative video
Jasinschi, RS
Moura, JMF
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 1220 - 1223
[49] ANIMATION EDITING ON VIDEO TAPE
MANTEL, H
JOURNAL OF THE SMPTE-SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS, 1964, 73 (07): : 561 - 563
[50] MODERN VIDEO TAPE EDITING
ORR, WH
COMMUNICATION & BROADCASTING, 1978, 4 (03): : 42 - 45

← 1 2 3 4 5 →