共 50 条
- [41] Eyes Closed, Safety on: Protecting Multimodal LLMs via Image-to-Text Transformation COMPUTER VISION - ECCV 2024, PT XVII, 2025, 15075 : 388 - 404
- [47] LimSim plus plus : A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving 2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1084 - 1090
- [48] GENIXER: Empowering Multimodal Large Language Model as a Powerful Data Generator COMPUTER VISION - ECCV 2024, PT XXIII, 2025, 15081 : 129 - 147
- [50] OmniActions: Predicting Digital Actions in Response to Real-World Multimodal Sensory Inputs with LLMs PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,