Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance

被引:0
|
作者
Nguyen, Manh-Duy [1 ]
Nguyen, Binh T. [2 ,3 ]
Gurrin, Cathal [1 ]
机构
[1] Dublin City Univ, Sch Comp, Dublin, Ireland
[2] Univ Sci, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
基金
爱尔兰科学基金会;
关键词
Lifelog; Interactive retrieval system; Multimodal retrieval; Fusion model;
D O I
10.1007/s13735-025-00359-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many lifelog retrieval systems have been introduced that apply various approaches to their search engines. The traditional method was to match concepts, which are visual objects detected in images and semantic queries. This concept-based approach has been applied in many retrieval systems, achieving the top performance in lifelog search challenges. Many novel embedding-based cross-modality retrieval models, such as CLIP, BLIP, or HADA, have been developed recently and obtained state-of-the-art (SOTA) results in the image-text retrieval task. These models have recently been applied in several lifelog search challenges. However, there is no comprehensive comparison between them since many benchmarking evaluations contain bias factors such as different user interfaces of participated lifelog retrieval systems. In this paper, we conducted non-biased experiments in both automatic (non-interactive) and interactive configurations to evaluate the performance of many SOTA retrieval models, including the traditional concept-based approach, in the lifelog retrieval task. Furthermore, we retrained the models in a lifelog Q&A dataset to assess whether retraining on a small lifelog dataset could improve the performance. The result showed that embedding-based search engines outperformed the concept-based approach by a large margin in both settings. The finding opens the opportunity to apply the embedding-based models as a new generation of lifelog retrieval models instead of the conventional concept-based approach. The source code and detailed result are available online https://github.com/m2man/Comparing-models-in-Lifelog-Retrieval-Task.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Spectral embedding-based multiview features fusion for content-based image retrieval
    Feng, Lin
    Yu, Laihang
    Zhu, Hai
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (05)
  • [42] Contextual Path Retrieval: A Contextual Entity Relation Embedding-based Approach
    Lo, Pei-Chi
    Lim, Ee-Peng
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (01)
  • [43] Improving Embedding-Based Retrieval in Friend Recommendation with ANN Query Expansion
    Kung, Pau Perng-Hwa
    Fan, Zihao
    Zhao, Tong
    Liu, Yozen
    Lai, Zhixin
    Shi, Jiahui
    Wu, Yan
    Yu, Jun
    Shah, Neil
    Venkataraman, Ganesh
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2930 - 2934
  • [44] Embedding-based Query Expansion for Weighted Sequential Dependence Retrieval Model
    Balaneshin-kordan, Saeid
    Kotov, Alexander
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1213 - 1216
  • [45] QuadrupletBERT: An Efficient Model For Embedding-Based Large-Scale Retrieval
    Liu, Peiyang
    Wang, Sen
    Wang, Xi
    Ye, Wei
    Zhang, Shikun
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3734 - 3739
  • [46] Understanding and Enhancing Robustness of Concept-Based Models
    Sinha, Sanchit
    Huai, Mengdi
    Sun, Jianhui
    Zhang, Aidong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15127 - 15135
  • [47] Kelpie: an Explainability Framework for Embedding-based Link Prediction Models
    Rossi, Andrea
    Firmani, Donatella
    Merialdo, Paolo
    Teofili, Tommaso
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3566 - 3569
  • [48] Evaluating the Robustness of Embedding-Based Topic Models to OCR Noise
    Zosa, Elaine
    Mutuvi, Stephen
    Granroth-Wilding, Mark
    Doucet, Antoine
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 392 - 400
  • [49] Architecture of a Concept-Based Information Retrieval System for Educational Resources
    Perez-Rodriguez, Roberto
    Anido-Rifon, Luis
    Gomez-Carballa, Miguel
    Mourino-Garcia, Marcos
    2014 INTERNATIONAL SYMPOSIUM ON COMPUTERS IN EDUCATION (SIIE), 2014, : 99 - 104
  • [50] Architecture of a concept-based information retrieval system for educational resources
    Perez-Rodriguez, Roberto
    Anido-Rifon, Luis
    Gomez-Carballa, Miguel
    Mourino-Garcia, Marcos
    SCIENCE OF COMPUTER PROGRAMMING, 2016, 129 : 72 - 91