共 50 条
- [32] Improving Visual Question Answering by Multimodal Gate Fusion Network 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [33] Putting ChatGPT vision (GPT-4V) to the test: risk perception in traffic images ROYAL SOCIETY OPEN SCIENCE, 2024, 11 (05):
- [34] Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data 2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 7839 - 7849
- [35] Glaucoma Detection and Feature Identification via GPT-4V Fundus Image Analysis OPHTHALMOLOGY SCIENCE, 2025, 5 (02):
- [37] Advancements in AI for Gastroenterology Education: An Assessment of OpenAI's GPT-4 and GPT-3.5 in MKSAP Question Interpretation AMERICAN JOURNAL OF GASTROENTEROLOGY, 2024, 119 (10S): : S1580 - S1580
- [40] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering IEEE ACCESS, 2020, 8 : 35662 - 35671