Joint Attention for Automated Video Editing

被引：2

作者：

Wu, Hui-Yin ^{[1
]}

Santarra, Trevor ^{[2
]}

Leece, Michael ^{[3
]}

Vargas, Rolando ^{[3
]}

Jhala, Arnav ^{[4
]}

机构：

[1] Univ Cote dAzur, INRIA, Sophia Antipolis, France

[2] Unity Technol, San Francisco, CA USA

[3] Univ Calif Santa Cruz, Santa Cruz, CA USA

[4] North Carolina State Univ, Raleigh, NC USA

来源：

PROCEEDINGS OF THE 2020 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE MEDIA EXPERIENCES, IMX 2020 | 2020年

关键词：

smart conferencing; automated video editing; joint attention; LSTM;

D O I：

10.1145/3391614.3393656

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Joint attention refers to the shared focal points of attention for occupants in a space. In this work, we introduce a computational definition of joint attention for the automated editing of meetings in multi-camera environments from the AMI corpus. Using extracted head pose and individual headset amplitude as features, we developed three editing methods: (1) a naive audio-based method that selects the camera using only the headset input, (2) a rule-based edit that selects cameras at a fixed pacing using pose data, and (3) an editing algorithm using LSTM (Long-short term memory) learned joint-attention from both pose and audio data, trained on expert edits. The methods are evaluated qualitatively against the human edit, and quantitatively in a user study with 22 participants. Results indicate that LSTM-trained joint attention produces edits that are comparable to the expert edit, offering a wider range of camera views than audio, while being more generalizable as compared to rule-based methods.

引用

页码：55 / 64

页数：10

共 50 条

[31] ENRICHED VIDEO EDITING
OLIVER, D
COMPUTER GRAPHICS WORLD, 1994, 17 (12) : 20 - &
[32] VIDEO EDITING TECHNIQUE
不详
BELL LABORATORIES RECORD, 1968, 46 (11): : 385 - &
[33] Seamless video editing
Wang, HC
Raskar, R
Ahuja, N
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 858 - 861
[34] Editing digital video
不详
TRAINING & DEVELOPMENT, 2001, 55 (01): : 71 - 71
[35] IS JOINT ATTENTION DETECTABLE at a DISTANCE? THREE AUTOMATED, INTERNET-BASED TESTS
Sheldrake, Rupert
Beeharee, Ashweeni
EXPLORE-THE JOURNAL OF SCIENCE AND HEALING, 2016, 12 (01) : 34 - 41
[36] An efficient framework for quantum video and video editing
Wei, Zhanhong
Sun, Wentao
Zhu, Shangchao
Han, Mengdi
Yin, Huijuan
INTERNATIONAL JOURNAL OF QUANTUM INFORMATION, 2023, 21 (05)
[37] Look At That! Video Chat and Joint Visual Attention Development Among Babies and Toddlers
McClure, Elisabeth R.
Chentsova-Dutton, Yulia E.
Holochwost, Steven J.
Parrott, W. G.
Barr, Rachel
CHILD DEVELOPMENT, 2018, 89 (01) : 27 - 36
[38] The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing
Argaw, Dawit Mureja
Heilbron, Fabian Caba
Lee, Joon-Young
Woodson, Markus
Kweon, In So
COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 201 - 218
[39] Interactive Intrinsic Video Editing
Bonneel, Nicolas
Sunkavalli, Kalyan
Tompkin, James
Sun, Deqing
Paris, Sylvain
Pfister, Hanspeter
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06):
[40] FILM EDITING THE VIDEO WAY
LANG, S
LANG, S
INDUSTRIAL PHOTOGRAPHY, 1984, 33 (09): : 34 - 35

← 1 2 3 4 5 →