共 50 条
- [42] An Adaptive Multimodal Fusion Network Based on Multilinear Gradients for Visual Question Answering ELECTRONICS, 2025, 14 (01):
- [48] A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answering Signal, Image and Video Processing, 2024, 18 : 3471 - 3482