共 50 条
- [1] Towards Global Video Scene Segmentation with Context-Aware Transformer THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3206 - 3213
- [2] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18061 - 18070
- [5] Video Diffusion Models with Local-Global Context Guidance PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1640 - 1648
- [6] DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 577 - 595
- [7] SLViT: Scale-Wise Language-Guided Vision Transformer for Referring Image Segmentation PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1294 - 1302
- [8] CLIP-It! Language-Guided Video Summarization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [9] Hybrid Local-Global Context Learning for Neural Video Compression 2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 322 - 331
- [10] mmFilter: Language-Guided Video Analytics at the Edge PROCEEDINGS OF THE 2020 21ST INTERNATIONAL MIDDLEWARE CONFERENCE INDUSTRIAL TRACK (MIDDLEWARE INDUSTRY '20), 2020, : 1 - 7