A Learning to Rank framework applied to text-image retrieval

被引:0
|
作者
David Buffoni
Sabrina Tollari
Patrick Gallinari
机构
[1] Université Pierre et Marie CURIE - Paris 6 / LIP6,
来源
关键词
Learning to Rank; Text-image retrieval; OWPC; Visuo-textual fusion; Pooling for Learning to Rank;
D O I
暂无
中图分类号
学科分类号
摘要
We present a framework based on a Learning to Rank setting for a text-image retrieval task. In Information Retrieval, the goal is to compute the similarity between a document and an user query. In the context of text-image retrieval where several similarities exist, human intervention is often needed to decide on the way to combine them. On the other hand, with the Learning to Rank approach the combination of the similarities is done automatically. Learning to Rank is a paradigm where the learnt objective function is able to produce a ranked list of images when a user query is given. These score functions are generally a combination of similarities between a document and a query. In the past, Learning to Rank algorithms were successfully applied to text retrieval where they outperformed baselines such as BM25 or TFIDF. This inspired us to apply our state-of-the-art algorithm, called OWPC (Usunier et al. 2009), to the text-image retrieval task. At this time, no benchmarks are available, therefore we present a framework for building one. The empirical validation of this algorithm is done on the dataset constructed through comparison of typical text-image retrieval similarities. In both cases, visual only and text and visual, our algorithm performs better than a simple baseline.
引用
收藏
页码:161 / 180
页数:19
相关论文
共 50 条
  • [1] A Learning to Rank framework applied to text-image retrieval
    Buffoni, David
    Tollari, Sabrina
    Gallinari, Patrick
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (01) : 161 - 180
  • [2] Text-Image Retrieval With Salient Features
    Feng, Xia
    Hu, Zhiyi
    Liu, Caihua
    Ip, W. H.
    Chen, Huiying
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 1 - 13
  • [3] The Importance of the Depth for Text-Image Selection Strategy in Learning-To-Rank
    Buffoni, David
    Tollari, Sabrina
    Gallinari, Patrick
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 743 - 746
  • [4] Experiences in evaluating multilingual and text-image information retrieval
    Garcia-Serrano, Ana M.
    Martinez-Fernandez, Jose L.
    Martinez, Paloma
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (07) : 655 - 677
  • [5] Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
    Carvalho, Micael
    Cadene, Remi
    Picard, David
    Soulier, Laure
    Thome, Nicolas
    Cord, Matthieu
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 35 - 44
  • [6] U-BERT for Fast and Scalable Text-Image Retrieval
    Yu, Tan
    Fei, Hongliang
    Li, Ping
    PROCEEDINGS OF THE 2022 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2022, 2022, : 103 - 113
  • [7] Multimodal Deep Learning Framework for Sentiment Analysis from Text-Image Web Data
    Thuseethan, Selvarajah
    Janarthan, Sivasubramaniam
    Rajasegarar, Sutharshan
    Kumari, Priya
    Yearwood, John
    2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 267 - 274
  • [8] Inflate and Shrink: Enriching and Reducing Interactions for Fast Text-Image Retrieval
    Liu, Haoliang
    Yu, Tan
    Li, Ping
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9796 - 9809
  • [9] Improving text-image cross-modal retrieval with contrastive loss
    Chumeng Zhang
    Yue Yang
    Junbo Guo
    Guoqing Jin
    Dan Song
    An An Liu
    Multimedia Systems, 2023, 29 : 569 - 575
  • [10] Enhancing Text-Image Person Retrieval Through Nuances Varied Sample
    Xia, Jiaer
    Yang, Haozhe
    Zhang, Yan
    Dai, Pingyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 185 - 196