Learning Physically Simulated Tennis Skills from Broadcast Videos

被引:18
|
作者
Zhang, Haotian [1 ]
Yuan, Ye [2 ]
Makoviychuk, Viktor [2 ]
Guo, Yunrong [3 ]
Fidler, Sanja [3 ,4 ,5 ]
Peng, Xue Bin [3 ,6 ]
Fatahalian, Kayvon [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] NVIDIA, Santa Clara, CA USA
[3] NVIDIA, Toronto, ON, Canada
[4] Univ Toronto, Toronto, ON, Canada
[5] Vector Inst, Toronto, ON, Canada
[6] Simon Fraser Univ, Burnaby, BC, Canada
来源
ACM TRANSACTIONS ON GRAPHICS | 2023年 / 42卷 / 04期
关键词
physics-based character animation; imitation learning; reinforcement learning;
D O I
10.1145/3592408
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a system that learns diverse, physically simulated tennis skills from large-scale demonstrations of tennis play harvested from broadcast videos. Our approach is built upon hierarchical models, combining a low-level imitation policy and a high-level motion planning policy to steer the character in a motion embedding learned from broadcast videos. When deployed at scale on large video collections that encompass a vast set of examples of real-world tennis play, our approach can learn complex tennis shotmaking skills and realistically chain together multiple shots into extended rallies, using only simple rewards and without explicit annotations of stroke types. To address the low quality of motions extracted from broadcast videos, we correct estimated motion with physics-based imitation, and use a hybrid control policy that overrides erroneous aspects of the learned motion embedding with corrections predicted by the high-level policy. We demonstrate that our system produces controllers for physically-simulated tennis players that can hit the incoming ball to target positions accurately using a diverse array of strokes (serves, forehands, and backhands), spins (topspins and slices), and playing styles (one/two-handed backhands, left/right-handed play). Overall, our system can synthesize two physically simulated characters playing extended tennis rallies with simulated racket and ball dynamics. Code and data for this work is available at https://research.nvidia.com/labs/toronto-ai/vid2player3d/.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Extraction of Positional Player Data from Broadcast Soccer Videos
    Theiner, Jonas
    Gritz, Wolfgang
    Mueller-Budack, Eric
    Rein, Robert
    Memmert, Daniel
    Ewerth, Ralph
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1463 - 1473
  • [22] Automatic Pitch Type Recognition from Baseball Broadcast Videos
    Takahashi, Masaki
    Fujii, Mahito
    Yagi, Nobuyuki
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 15 - 22
  • [24] Brain Jogging Training to Improve Motivation and Learning Result of Tennis Skills
    Tafaqur, M.
    Komarudin
    Mulyana
    Saputra, M. Y.
    1ST ANNUAL APPLIED SCIENCE AND ENGINEERING CONFERENCE (AASEC), IN CONJUCTION WITH THE INTERNATIONAL CONFERENCE ON SPORT SCIENCE, HEALTH, AND PHYSICAL EDUCATION (ICSSHPE), 2017, 180
  • [25] Video synthesis at tennis player viewpoint from multiple view videos
    Kimura, K
    Saito, H
    IEEE VIRTUAL REALITY 2005, CONFERENCE PROCEEDINGS, 2005, : 281 - 282
  • [26] SuperTrack: Motion Tracking for Physically Simulated Characters using Supervised Learning
    Fussell, Levi
    Bergamin, Kevin
    Holden, Daniel
    ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (06):
  • [27] A Simulated Learning Environment for Teaching Medicine Dispensing Skills
    McDowell, Jenny
    Styles, Kim
    Sewell, Keith
    Trinder, Peta
    Marriott, Jennifer
    Maher, Sheryl
    Naidu, Som
    AMERICAN JOURNAL OF PHARMACEUTICAL EDUCATION, 2016, 80 (01)
  • [28] Learning Overtaking and Blocking Skills in Simulated Car Racing
    Huang, Han-Hsien
    Wang, Tsaipei
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2015, : 439 - 445
  • [29] AUTOMATIC KEY-FRAME EXTRACTION FROM BROADCAST SOCCER VIDEOS
    Simoes, Nielsen C.
    Leite, Neucimar J.
    Marcotegui, Beatriz
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2009, : 216 - +
  • [30] Self-Controlled Video Feedback Facilitates the Learning of Tactical Skills in Tennis
    van der Meer, Bart R.
    van den Hoven, Michel A. C.
    van der Kamp, John
    Savelsbergh, Geert J. P.
    RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 2024, 95 (02) : 537 - 545