Fine -Grained Distillation for Long Document Retrieval

被引:0
|
作者
Zhou, Yucheng [1 ,4 ]
Shen, Tao [2 ]
Geng, Xiubo [3 ]
Tao, Chongyang [3 ]
Shen, Jianbing [1 ]
Long, Guodong [2 ]
Xu, Can [3 ]
Jiang, Daxin [3 ]
机构
[1] Univ Macau, CIS, SKL IOTSC, Taipa, Macau, Peoples R China
[2] Univ Technol Sydney, AAII, FEIT, Sydney, NSW, Australia
[3] Microsoft Corp, Redmond, WA 98052 USA
[4] Microsoft, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long document retrieval aims to fetch query -relevant documents from a large-scale collection, where knowledge distillation has become de facto to improve a retriever by mimicking a heterogeneous yet powerful cross -encoder. However, in contrast to passages or sentences, retrieval on long documents suffers from the scope hypothesis that a long document may cover multiple topics. This maximizes their structure heterogeneity and poses a granular-mismatch issue, leading to an inferior distillation efficacy. In this work, we propose a new learning framework, fine-grained distillation (FGD), for long -document retrievers. While preserving the conventional dense retrieval paradigm, it first produces global -consistent representations crossing different fine granularity and then applies multi-granular aligned distillation merely during training. In experiments, we evaluate our framework on two long document retrieval benchmarks, which show state-of-the-art performance.
引用
收藏
页码:19732 / 19740
页数:9
相关论文
共 50 条
  • [31] Fine-Grained Prototypes Distillation for Few-Shot Object Detection
    Wang, Zichen
    Yang, Bo
    Yue, Haonan
    Ma, Zhenghao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5859 - 5866
  • [32] Identifying apple leaf disease using a fine-grained distillation model
    Li D.
    Hua C.
    Liu Y.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (07): : 185 - 194
  • [33] Filtration and Distillation: Enhancing Region Attention for Fine-Grained Visual Categorization
    Liu, Chuanbin
    Xie, Hongtao
    Zha, Zheng-Jun
    Ma, Lingfeng
    Yu, Lingyun
    Zhang, Yongdong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11555 - 11562
  • [34] Loop and distillation: Attention weights fusion transformer for fine-grained representation
    Fayou, Sun
    Ngo, Hea Choon
    Meng, Zuqiang
    Sek, Yong Wee
    IET COMPUTER VISION, 2023, 17 (04) : 473 - 482
  • [35] Data-free Knowledge Distillation for Fine-grained Visual Categorization
    Shao, Renrong
    Zhang, Wei
    Yin, Jianhua
    Wang, Jun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1515 - 1525
  • [36] Localized Triplet Loss for Fine-grained Fashion Image Retrieval
    D'Innocente, Antonio
    Garg, Nikhil
    Zhang, Yuan
    Bazzani, Loris
    Donoser, Michael
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3905 - 3910
  • [37] CEKD:Cross ensemble knowledge distillation for augmented fine-grained data
    Ke Zhang
    Jin Fan
    Shaoli Huang
    Yongliang Qiao
    Xiaofeng Yu
    Feiwei Qin
    Applied Intelligence, 2022, 52 : 16640 - 16650
  • [38] Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation
    Hao, Zhiwei
    Guo, Jianyuan
    Jia, Ding
    Han, Kai
    Tang, Yehui
    Zhang, Chao
    Hu, Han
    Wang, Yunhe
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [39] Fine-Grained Color Sketch-Based Image Retrieval
    Xia, Yu
    Wang, Shuangbu
    Li, Yanran
    You, Lihua
    Yang, Xiaosong
    Zhang, Jian Jun
    ADVANCES IN COMPUTER GRAPHICS, CGI 2019, 2019, 11542 : 424 - 430
  • [40] Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval
    Lu, Xin
    Chen, Shikun
    Cao, Yichao
    Zhou, Xin
    Lu, Xiaobo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6558 - 6566