共 50 条
- [32] Knowledge enhancement and scene understanding for knowledge-based visual question answering Knowledge and Information Systems, 2024, 66 : 2193 - 2208
- [34] OECA-Net: A co-attention network for visual question answering based on OCR scene text feature enhancement Multimedia Tools and Applications, 2024, 83 : 7085 - 7096
- [35] Question Modifiers in Visual Question Answering LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
- [38] Answer-Based Entity Extraction and Alignment for Visual Text Question Answering PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9487 - 9491
- [39] VTQAGen: BART-based Generative Model For Visual Text Question Answering PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9456 - 9461
- [40] Visual question answering based evaluation metrics for text-to-image generation 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,