RL Based Unsupervised Video Summarization Framework for Ultrasound Imaging

被引:0
|
作者
Mathews, Roshan P. [1 ]
Panicker, Mahesh Raveendranatha [1 ]
Hareendranathan, Abhilash R. [2 ]
Chen, Yale Tung [3 ]
Jaremko, Jacob L. [2 ]
Buchanan, Brian [2 ]
Narayan, Kiran Vishnu [4 ]
Chandrasekharan, Kesavadas [5 ]
Mathews, Greeta [6 ]
机构
[1] Indian Inst Technol Palakkad, Palakkad, India
[2] Univ Alberta, Edmonton, AB, Canada
[3] Hosp Univ Puerta de Hierro Spain, Madrid, Spain
[4] Govt Med Coll Thiruvananthapuram, Thiruvananthapuram, Kerala, India
[5] Sree Chitra Inst Med Sci & Technol Thiruvananthap, Thiruvananthapuram, Kerala, India
[6] Bhagwan Mahaveer Jain Hosp Bangalore, Bangalore, Karnataka, India
来源
关键词
Ultrasound; Video summarization; Unsupervised reinforcement learning; Convolutional autoencoder; Transformer;
D O I
10.1007/978-3-031-16902-1_3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The need for summarizing long medical scan videos for automatic triage in Emergency Departments and transmission of the summarized videos for telemedicine has gained significance during the COVID-19 pandemic. However, supervised learning schemes for summarizing videos are infeasible as manual labeling of scans for large datasets is impractical by frontline clinicians. This work presents a methodology to summarize ultrasound videos using completely unsupervised learning schemes and is validated on Lung Ultrasound videos. A Convolutional Autoencoder and a Transformer decoder is trained in an unsupervised reinforcement learning setup i.e., without supervised labels in the whole workflow. Novel precision and recall computation for ultrasound videos is also presented employing which high Precision and F 1 scores of 64.36% and 35.87% with an average video compression rate of 78% is obtained when validated against clinically annotated cases. Even though demonstrated using lung ultrasound videos, our approach can be readily extended to other imaging modalities.
引用
收藏
页码:23 / 33
页数:11
相关论文
共 50 条
  • [21] An extended framework for adaptive playback-based video summarization
    Peker, KA
    Divakaran, A
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 26 - 33
  • [22] Deep Semantic and Attentive Network for Unsupervised Video Summarization
    Zhong, Sheng-Hua
    Lin, Jingxu
    Lu, Jianglin
    Fares, Ahmed
    Ren, Tongwei
    ACM Transactions on Multimedia Computing, Communications and Applications, 2022, 18 (02)
  • [23] Unsupervised video summarization via clustering validity index
    Zhao, Ye
    Guo, Yanrong
    Sun, Rui
    Liu, Zhengqiong
    Guo, Dan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33417 - 33430
  • [24] Joint Reinforcement and Contrastive Learning for Unsupervised Video Summarization
    Zhang, Yunzuo
    Liu, Yameng
    Zhu, Pengfei
    Kang, Weili
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2587 - 2591
  • [25] Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
    Pang, Zongshang
    Nakashima, Yuta
    Otani, Mayu
    Nagahara, Hajime
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2009 - 2018
  • [26] Unsupervised Video Summarization with Independently Recurrent Neural Networks
    Yaliniz, Gokhan
    Ikizler-Cinbis, Nazli
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [27] An Aesthetic-Driven Approach to Unsupervised Video Summarization
    Huang, Hongben
    Wu, Zaiqun
    Pang, Guangyao
    Xie, Jiehang
    IEEE ACCESS, 2024, 12 : 128768 - 128777
  • [28] Unsupervised learning of visual and semantic features for video summarization
    Huang, Yansen
    Zhong, Rui
    Yao, Wenjin
    Wang, Rui
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [29] SPARSE UNSUPERVISED CLUSTERING WITH MIXTURE OBSERVATIONS FOR VIDEO SUMMARIZATION
    Xiang, Xiang
    Tran, Dung N.
    Tran, Trac D.
    2017 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2017,
  • [30] Unsupervised video summarization via clustering validity index
    Ye Zhao
    Yanrong Guo
    Rui Sun
    Zhengqiong Liu
    Dan Guo
    Multimedia Tools and Applications, 2020, 79 : 33417 - 33430