Can Language Models Laugh at YouTube Short-form Videos?

被引:0
|
作者
Ko, Dayoon [1 ]
Lee, Sangho [2 ]
Kim, Gunhee [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Allen Inst Artificial Intelligence, Seattle, WA USA
基金
新加坡国家研究基金会;
关键词
RESOLUTION; HUMOR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As short-form funny videos on social networks are gaining popularity, it becomes demanding for AI models to understand them for better communication with humans. Unfortunately, previous video humor datasets target specific domains such as speeches or sitcoms, and mostly focus on verbal cues. We curate a user-generated dataset of 10K multimodal funny videos from YouTube, called ExFunTube. Using a video filtering pipeline with GPT-3.5, we verify both verbal and visual elements contributing to humor. After filtering, we annotate each video with timestamps and text explanations for funny moments. Our ExFunTube is unique over existing datasets in that our videos cover a wide range of domains with various types of humor that necessitate a multimodal understanding of the content. Also, we develop a zero-shot video-to-text prompting to maximize video humor understanding of large language models (LLMs). With three different evaluation methods using automatic scores, rationale quality experiments, and human evaluations, we show that our prompting significantly improves LLMs' ability for humor explanation.
引用
收藏
页码:2897 / 2916
页数:20
相关论文
共 50 条
  • [1] Short-Form Videos for Colorectal Cancer Screening Awareness
    Restrepo, Nicolas
    Escobar, Betsy
    Suarez, Milena G.
    Montealegre, Jane
    Jibaja-Weiss, Maria
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2024, 119 (10S): : S377 - S378
  • [2] Understanding the Effects of Short-Form Videos on Sustained Attention
    Lin, Bei-Hong
    Chung, Yu-Jung
    Cheng, Hao-Yuan
    Yen, Yu-Ting
    Li, Ching-Chuan
    Cherng, Fu-Yin
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [3] Generating Hashtags for Short-form Videos with Guided Signals
    Yu, Tiezheng
    Yu, Hanchao
    Liang, Davis
    Mao, Yuning
    Nie, Shaoliang
    Huang, Po-Yao
    Khabsa, Madian
    Fung, Pascale
    Wang, Yi-Chia
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 9482 - 9495
  • [4] Making Short-Form Videos Accessible with Hierarchical Video Summaries
    Van Daele, Tess
    Iyer, Akhil
    Zhang, Yuning
    Derry, Jalyn C.
    Huh, Mina
    Pavel, Amy
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [5] Can Videos on TikTok Improve Pap Smear Attitudes and Intentions? Effects of Source and Autonomy Support in Short-Form Health Videos
    Kirkpatrick, Ciera E.
    Lawrie, LaRissa L.
    HEALTH COMMUNICATION, 2024, 39 (10) : 2066 - 2078
  • [6] Short-Form Videos as an Emerging Social Media Tool for STEM Edutainment
    Prindle, Claudia R.
    Orchanian, Nicholas M.
    Venkataraman, Latha
    Nuckolls, Colin
    JOURNAL OF CHEMICAL EDUCATION, 2024, 101 (03) : 1319 - 1324
  • [7] Investigating Consumer Attitudes Toward Recessive Advertising in Short-Form Videos
    Fu, Xiaoliang
    Liu, Lili
    Liang, Siyi
    Ling, Zipei
    Qian, Xiarui
    Mao, Zhewei
    HCI IN BUSINESS, GOVERNMENT AND ORGANIZATIONS, PT I, HCIBGO 2024, 2024, 14720 : 23 - 32
  • [8] Using short-form student videos to widen the canon of political thought
    Karlsson, Rasmus
    Eriksson, Kalle
    LEARNING AND TEACHING-THE INTERNATIONAL JOURNAL OF HIGHER EDUCATION IN THE SOCIAL SCIENCES, 2022, 15 (01) : 92 - 100
  • [9] Short-Form Videos for Public Library Marketing: Performance Analytics of Douyin in China
    Liu, Ying
    Chiu, Dickson K. W.
    Ho, Kevin K. W.
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [10] Design guidelines for augmenting short-form videos using animated data visualizations
    Tang, Tan
    Tang, Junxiu
    Hong, Jiayi
    Yu, Lingyun
    Ren, Peiran
    Wu, Yingcai
    JOURNAL OF VISUALIZATION, 2020, 23 (04) : 707 - 720