Beyond audio and video retrieval: topic-oriented multimedia summarization

被引:13
|
作者
Metze, Florian [1 ]
Ding, Duo [1 ]
Younessian, Ehsan [1 ]
Hauptmann, Alexander [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Multimedia summarization; Event detection and recounting; Natural language generation;
D O I
10.1007/s13735-012-0028-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the deluge of multimedia content that is becoming available over the Internet, it is increasingly important to be able to effectively examine and organize these large stores of information inways that go beyond browsing or collaborative filtering. In this paper, we review previous work on audio and video processing, and define the task of topic-oriented multimedia summarization (TOMS) using natural language generation (NLG): given a set of automatically extracted features from a video, a TOMS system will automatically generate a paragraph of natural language, which summarizes the important information in a video belonging to a certain topic, and for example provides explanations for why a video was matched and retrieved. Possible features include visual semantic concepts, objects, and actions, environmental sounds, and transcripts from automatic speech recognition (ASR). We see this as a first step towards systems that will be able to discriminate visually similar, but semantically different videos, compare two videos and provide textual output or summarize a large number of videos at once. In this paper, we introduce our approach of solving the TOMS problem. We extract various visual concept features, environmental sounds and ASR transcription features from a given video, and develop a template-based NLG system to produce a textual recounting based on the extracted features. We also propose possible experimental designs for continuously evaluating and improving TOMS systems, and present results of a pilot evaluation of our initial system.
引用
收藏
页码:131 / 144
页数:14
相关论文
共 50 条
  • [1] SPARSE MODELING FOR TOPIC-ORIENTED VIDEO SUMMARIZATION
    Panda, Rameswar
    Roy-Chowdhury, Amit K.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1388 - 1392
  • [2] Topic-Oriented Dialogue Summarization
    Lin, Haitao
    Zhu, Junnan
    Xiang, Lu
    Zhai, Feifei
    Zhou, Yu
    Zhang, Jiajun
    Zong, Chengqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1797 - 1810
  • [3] Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling
    Zou, Yicheng
    Zhao, Lujun
    Kang, Yangyang
    Lin, Jun
    Peng, Minlong
    Jiang, Zhuoren
    Sun, Changlong
    Zhang, Qi
    Huang, Xuanjing
    Liu, Xiaozhong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14665 - 14673
  • [4] SCHC: Incorporating Social Contagion and Hashtag Consistency for Topic-Oriented Social Summarization
    He, Ruifang
    Liu, Huanyu
    Zhao, Liangliang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 641 - 657
  • [5] Topic-oriented mining and reasoning
    Li, YF
    Zhong, N
    Yao, YY
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ACTIVE MEDIA TECHNOLOGY (AMT 2005), 2005, : 321 - 326
  • [6] Automatic summarization of MEDLINE citations for evidence-based medical treatment: A topic-oriented evaluation
    Fiszman, Marcelo
    Demner-Fushman, Dina
    Kilicoglu, Halil
    Rindflesch, Thomas C.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 801 - 813
  • [7] Topic-oriented measurement of microblogging network
    Liu, Wei
    Wang, Li-Hong
    Li, Rui-Guang
    Tongxin Xuebao/Journal on Communications, 2013, 34 (11): : 171 - 178
  • [8] Topic-Oriented Text Features Can Match Visual Deep Models of Video Memorability
    Kleinlein, Ricardo
    Luna-Jimenez, Cristina
    Arias-Cuadrado, David
    Ferreiros, Javier
    Fernandez-Martinez, Fernando
    APPLIED SCIENCES-BASEL, 2021, 11 (16):
  • [9] Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders
    Zou, Yicheng
    Lin, Jun
    Zhao, Lujun
    Kang, Yangyang
    Jiang, Zhuoren
    Sun, Changlong
    Zhang, Qi
    Huang, Xuanjing
    Liu, Xiaozhong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14674 - 14682
  • [10] A topic-oriented clustering approach for domain services
    Wang, J. (jianwang@whu.edu.cn), 1600, Science Press (51):