Can Language Models Laugh at YouTube Short-form Videos?

被引:0
|
作者
Ko, Dayoon [1 ]
Lee, Sangho [2 ]
Kim, Gunhee [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Allen Inst Artificial Intelligence, Seattle, WA USA
基金
新加坡国家研究基金会;
关键词
RESOLUTION; HUMOR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As short-form funny videos on social networks are gaining popularity, it becomes demanding for AI models to understand them for better communication with humans. Unfortunately, previous video humor datasets target specific domains such as speeches or sitcoms, and mostly focus on verbal cues. We curate a user-generated dataset of 10K multimodal funny videos from YouTube, called ExFunTube. Using a video filtering pipeline with GPT-3.5, we verify both verbal and visual elements contributing to humor. After filtering, we annotate each video with timestamps and text explanations for funny moments. Our ExFunTube is unique over existing datasets in that our videos cover a wide range of domains with various types of humor that necessitate a multimodal understanding of the content. Also, we develop a zero-shot video-to-text prompting to maximize video humor understanding of large language models (LLMs). With three different evaluation methods using automatic scores, rationale quality experiments, and human evaluations, we show that our prompting significantly improves LLMs' ability for humor explanation.
引用
收藏
页码:2897 / 2916
页数:20
相关论文
共 50 条
  • [21] Driving Factors and Moderating Effects Behind Citizen Engagement With Mobile Short-Form Videos
    Zhang, Cevin
    Zheng, Hemingxi
    Wang, Qing
    IEEE ACCESS, 2022, 10 : 40999 - 41009
  • [22] VisTellAR: Embedding Data Visualization to Short-Form Videos Using Mobile Augmented Reality
    Tong, Wai
    Shigyo, Kento
    Yuan, Lin-Ping
    Fan, Mingming
    Pong, Ting-Chuen
    Qu, Huamin
    Xia, Meng
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (03) : 1862 - 1874
  • [23] The Relationship Between Parental Phubbing and Short-Form Videos Addiction Among Chinese Adolescents
    Wang, Hongxia
    Lei, Li
    JOURNAL OF RESEARCH ON ADOLESCENCE, 2022, 32 (04) : 1580 - 1591
  • [24] Recognizing irrelevant faces in short-form videos based on feature fusion and active learning
    Zhu, Mingcheng
    Zhang, Rongchuan
    Wang, Haizhou
    Neurocomputing, 2022, 501 : 694 - 704
  • [25] Dense Models from Videos: Can YouTube be the Font of All Knowledge Bases?
    Witbrock, Michael
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 1 - 1
  • [26] A SHORT-FORM MEASURE OF LONELINESS
    HAYS, RD
    DIMATTEO, MR
    JOURNAL OF PERSONALITY ASSESSMENT, 1987, 51 (01) : 69 - 81
  • [27] On the sins of short-form development
    Smith, GT
    McCarthy, DM
    Anderson, KG
    PSYCHOLOGICAL ASSESSMENT, 2000, 12 (01) : 102 - 111
  • [28] Ambiguities in the Short-Form Report
    Lee, Earle Goodrich
    JOURNAL OF ACCOUNTANCY, 1947, 84 (03): : 245 - 245
  • [29] Evolution to short-form theatre
    Devos, BW
    BULLETIN OF HISPANIC STUDIES, 2005, 82 (01): : 106 - 107
  • [30] MT-VQA: A Multi-task Approach for Quality Assessment of Short-form Videos
    Wen, Shijie
    Qiao, Minglang
    Jiang, Lai
    Xu, Mai
    Deng, Xin
    Li, Shengxi
    PROCEEDINGS OF THE 3RD WORKSHOP ON QUALITY OF EXPERIENCE IN VISUAL MULTIMEDIA APPLICATIONS, QOEVMA 2024, 2024, : 30 - 38