Fine -Grained Distillation for Long Document Retrieval

被引:0
|
作者
Zhou, Yucheng [1 ,4 ]
Shen, Tao [2 ]
Geng, Xiubo [3 ]
Tao, Chongyang [3 ]
Shen, Jianbing [1 ]
Long, Guodong [2 ]
Xu, Can [3 ]
Jiang, Daxin [3 ]
机构
[1] Univ Macau, CIS, SKL IOTSC, Taipa, Macau, Peoples R China
[2] Univ Technol Sydney, AAII, FEIT, Sydney, NSW, Australia
[3] Microsoft Corp, Redmond, WA 98052 USA
[4] Microsoft, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long document retrieval aims to fetch query -relevant documents from a large-scale collection, where knowledge distillation has become de facto to improve a retriever by mimicking a heterogeneous yet powerful cross -encoder. However, in contrast to passages or sentences, retrieval on long documents suffers from the scope hypothesis that a long document may cover multiple topics. This maximizes their structure heterogeneity and poses a granular-mismatch issue, leading to an inferior distillation efficacy. In this work, we propose a new learning framework, fine-grained distillation (FGD), for long -document retrievers. While preserving the conventional dense retrieval paradigm, it first produces global -consistent representations crossing different fine granularity and then applies multi-granular aligned distillation merely during training. In experiments, we evaluate our framework on two long document retrieval benchmarks, which show state-of-the-art performance.
引用
收藏
页码:19732 / 19740
页数:9
相关论文
共 50 条
  • [41] Adaptive Fine-Grained Sketch-Based Image Retrieval
    Bhunia, Ayan Kumar
    Sain, Aneeshan
    Shah, Parth Hiren
    Gupta, Animesh
    Chowdhury, Pinaki Nath
    Xiang, Tao
    Song, Yi-Zhe
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 163 - 181
  • [42] Hard Decorrelated Centralized Loss for fine-grained image retrieval
    Zeng, Xianxian
    Liu, Shun
    Wang, Xiaodong
    Zhang, Yun
    Chen, Kairui
    Li, Dong
    NEUROCOMPUTING, 2021, 453 : 26 - 37
  • [43] Towards Fine-grained Adaptation of Exploration/Exploitation in Information Retrieval
    Medlar, Alan
    Pyykko, Joel
    Glowacka, Dorota
    IUI'17: PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2017, : 623 - 627
  • [44] Topic-Grained Text Representation-Based Model for Document Retrieval
    Du, Mengxue
    Li, Shasha
    Jie, Yu
    Ma, Jun
    Bin, Ji
    Liu, Huijun
    Lin, Wuhang
    Yi, Zibo
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 776 - 788
  • [45] Generalising Fine-Grained Sketch-Based Image Retrieval
    Pang, Kaiyue
    Li, Ke
    Yang, Yongxin
    Zhang, Honggang
    Hospedales, Timothy M.
    Xiang, Tao
    Song, Yi-Zhe
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 677 - 686
  • [46] Discriminative feature mining hashing for fine-grained image retrieval
    Lang, Wenxi
    Sun, Han
    Xu, Can
    Liu, Ningzhong
    Zhou, Huiyu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [47] Style Finder: Fine-Grained Clothing Style Recognition and Retrieval
    Di, Wei
    Wah, Catherine
    Bhardwaj, Anurag
    Piramuthu, Robinson
    Sundaresan, Neel
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 8 - 13
  • [48] Selective Convolutional Descriptor Aggregation for Fine-Grained Image Retrieval
    Wei, Xiu-Shen
    Luo, Jian-Hao
    Wu, Jianxin
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) : 2868 - 2881
  • [49] Deep Listwise Triplet Hashing for Fine-Grained Image Retrieval
    Liang, Yuchen
    Pan, Yan
    Lai, Hanjiang
    Liu, Wei
    Yin, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 949 - 961
  • [50] Fine-grained Classification of Identity Document Types with Only One Example
    Simon, Marcel
    Rodner, Erik
    Denzler, Joachim
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 126 - 129