共 50 条
- [41] CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker Diarization 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 102 - 106
- [42] Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2021, 2 : 1495 - 1499
- [43] Self-Attention Channel Combinator Frontend for End-to-End Multichannel Far-field Speech Recognition INTERSPEECH 2021, 2021, : 3840 - 3844
- [45] Self-Distillation into Self-Attention Heads for Improving Transformer-based End-to-End Neural Speaker Diarization INTERSPEECH 2023, 2023, : 3197 - 3201
- [46] SWINBERT: End-to-End Transformers with Sparse Attention for Video Captioning 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17928 - 17937
- [47] Attention Based End-to-End Network for Short Video Classification 2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 490 - 494
- [48] End-to-End Multi-Task Learning with Attention 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1871 - 1880
- [50] End-to-end frame-rate adaptive streaming of video data IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 67 - 71